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METHOD S_OF_GENERA^ 

RESIDUES_IN_PROTEINS 

gg£iS&g££B^-£f -£*l£_l n v entio11 

In both bacterial and eukaryotic cells, 

05 relatively long-lived proteins, whose half-lives are 
close to or exceed the, cell generation time, coexist 
with proteins whose half-lives can be less than one 
percent of the cell generation time. Rates of 
intracellular protein degradation are a function of 

10 the cell's physiological state, and appear to be 

controlled differentially for individual proteins. 
In particular, damaged and otherwise abnormal 
proteins are me tabol ically unstable in vivo. 
Although the specific functions of selective protein 

15 degradation are in most cases still unknown, it is 
clear that many regulatory proteins are extremely 
short-lived in vivo. Metabolic instability of such 
proteins allows for rapid adjustment of their 
intracellular concentrations through regulated 

20 changes in rates of their synthesis or degradation. 
The few instances in which the metabolic instability 
of an intracellular protein has been shown to be 
essential for its function include the ell protein 
of bacteriophage lambda and the HO endonuclease of 

25 the yeast Sac char omyces c e r e v i si a e . 

Most of the selective turnover of intracellular 
proteins under normal metabolic conditions is 
ATT -de-pendent and (In eukaryotes) nonlysosomal . 
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Recent biochemical and genetic evidence indicates 
that, in eukaryotes, covalent conjugation of ubi- 
quitin to short-lived intracellular proteins is 
essential for their selective degradation. The 
05 rules which determine whether a given protein is 
metabolically stable or unstable in vivo were 
previously unknown . 

S ummary of t he ^Invention 

This invention pertains to methods of engineer- 

10 ing the amino - terminus of proteins thereby con- 
trolling the metabolic stability and other proper- 
ties of a protein. Further, this invention provides 
a method for either in vivo or in vitr o production 
of proteins with any of the twenty amino acid 

15 residues (or analogs thereof) at the protein's 

amino - terminus . The invention is based in part upon 
the striking discovery that the in vivo half- life of 
an intracellular protein is a function of its 
amino - terminal amino acid residue and upon a novel 

20 (and more generally applicable) technique that 
allows one to generate proteins with specified 
amino -termini in vivo or in vitro . The invention 
also pertains to a newly identified protease, 
ubiquitin-specif ic processing protease, which has 

25 properties that allow one to expose, either in vitro 
or in vivo, any desired amino acid residue, other 
than proline, at the amino - terminus of a protein of 
Interest. 
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The nature of the amino acid exposed at the 
amino -terminus of an intracellular protein was shown 
to be one crucial determinant that specifies whether 
a protein will be long- or short-lived in vivo. 
Individual amino acids can be categorized as either 
stabilizing or destabilizing amino acids with 
respect to the half-life that they confer upon a 
protein when exposed at the protein's amino- 
terminus. Destabilizing amino acid residues confer 
short half-lives, down to a few minutes for some of 
the destabilizing amino acids. Stabilizing amino 
acid residues confer long half-lives of many hours. 
This striking and newly discovered dependency of a 
protein's half-life on its amino - terminal residue is 
15 referred to herein as the N-end rule. 

For some proteins, the presence of a destabili- 
zing amino acid at the amino terminus is necessary 
but not sufficient for destabilization . This is so 
because the complete amino - terminal degradation 
signal in a short-lived protein comprises two 
distinct determinants, each of which is necessary, 
but each of which, by itself, is insufficient for 
efficient destabilization of a protein. One 
determinant, described above, is the amino - terminal 
residue of the protein. The second determinant, 
described below, is a specific internal lysine 
residue. The ability of this critical lysine 
residue to serve as the second determinant is to a 
significant extent independent of the amino acid 



20 



25 
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sequences surrounding the residue. Instead, an 
essential feature of this critical lysine residue 
includes its spatial proximity to the protein's 
amino- terminus . 

05 Based upon the N-end rule, the amino - terminus 

of a protein can thus be designed or altered to 
change, the Intracellular half-life of the-, protein 
and in this way the lifetime and/or activity of the 
protein in vivo can be regulated. This capability 

10 can be exploited for rational protein design in many 
different contexts. Natural proteins can be 
modified to render them more or less resistant to 
degradation in vivo. The design or alteration of 
the protein can be done at the protein level or at 

15 the genetic (DNA) level. For example, proteins can 
be modified by chemically altering or engineering 
the amino -terminus to provide for exposure at the 
amino - terminus of an amino acid residue of the 
stabilizing or destabilizing class. At the genetic 

2q level, genes encoding proteins can be made to encode 
an amino acid of the desired class at the amino - 
terminus so that the expressed protein exhibits a 
predetermined amino -terminal structure which renders 
it either me tabolically stable or unstable with 

25 respect to the N-end rule pathway of proteolytic 
degradation. Amino - terminal regions can be 
engineered to provide appropriately located lysine 
residues In the context of a sufficiently seg- 
mentally mobile amino terminus to produce 
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destabilized protein. Furthermore, proteins can be 
expressed fused to a "masking" protein sequence 
which masks the engineered amino - terminus so that 
when unmasked the protein will exhibit the desired 

05 metabolic stability or other properties that depend 
on the nature of the protein's amino- terminal 
residue. In such constructs, for example, the 
junction between the two protein sequences can be 
designed to be cleaved specifically, for instance, 

10 by an endop r o teas e . Endoproteoly tic cleavage of the 
fused sequence unmasks the specifically engineered 
amino- terminus of the protein of interest and 
subjects the protein to degradation governed by the 
N'-end rule. One specific and new way to engineer 

15 the protein's amino - terminus is provided in this 
invention by the identification of ubiquitin- 
specific processing protease and determination of 
its substrate specificity. Using this protease, 
fusions of ubiquitin with other proteins can be 

20 specifically processed either in vitro or in vivo to 
generate proteins with desired amino - terminal 
r e s idues . 

A different, and also new way to specifically 
engineer short-lived proteins is provided in this 
25 invention by the discovery that ub iqui t in - pro t e in 

fusions, such as ub iqui t in- Pro - fi - galac to s idase , that 
cannot be efficiently deub iqui t inat ed , are 
metabolically unstable. Thus, by attaching the 
amino -terminal ubiquitin moiety -to a pxoxein in a 
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way that makes its removal either impossible or 
inefficient, one can destabilize proteins by a 
distinct technique that is not directly based on the 
N-end rule. 

05 In addition, variant cells can be developed 

which contain putative mutations in the "N-end" 
degrading p ro teas e ( s.) which either conditionally or 
noncohditionally stop degrading short-lived pro- 
teins. These cells can be used to overproduce 

10 proteins that ordinarily would be short-lived within 
the cell. 

Figure 1 shows the construction of ubiquitin- 
lacZ gene fusions. 
15 Figure 2 shows experiments in which the half- 

lives of engineered' £-gal proteins are directly 
measured. 

Figure 3 shows the changing of amino acid 
residues at the ub iqui tin-/3 - gal junction (A) using 
20 the newly discovered properties of ubiquitin- 

specific processing protease and the amino acid 
sequence in the vicinity of the junction (B) . 

Figure 4 shows the presence of multiple ubi- 
quitin moieties in me tabolically unstable 0-gal 
25 proteins. 

Figure 5 shows a series of /3-gal species 
containing ubiquitin in metabolically unstable /3-gal 
proteins- 
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Figure 6 shows that both prokaryotic and 
eukaryotic long-lived intracellular proteins have 
stabilizing amino acid residues at their amino- 
termini whereas secreted proteins exhibit a com- 
05 plementary bias. 

Figures 7 and 8 show the construction of 
ubiquitin fusions with mouse dihy dr o f o late 
reduc tas e . 

2£ tai l^ed_Des cr i£t ion_of _the_In v ent ion 

10 The elucidation of the N-end rule is described 

in detail below. Briefly, this rule governing 
protein degradation was revealed by examining the in 
vivo half-lives of the enzyme fi - galac tos idase having 
various amino acid residues at its amino - te rminus 

15 and produced as a fusion protein with ubiquitin. 

When a chimeric gene encoding a ub iqui t in - fi - galac - 
tosidase fusion protein is expressed in the yeast S. 
££££Xi!i££> ubiquitin is cleaved off the nascent 
fusion protein, yielding a deub iqui t ina ted- fi - galac - 

20 tosidase (£gal) . With one exception, this cleavage 
takes place efficiently regardless of the nature of 
the amino acid residue of 0gal at the ubiqui t in-^gal 
junction, thereby making it possible to expose 
selectively different residues at the amino- termini 

25 of otherwise identical figal proteins. The 0gal pro- 
teins so designed exhibited strikingly different 
half-lives in vivo, ranging from more than 20 hours 
to less than 3 minutes, depending upon the nature of 
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the amino acid at the amino - terminus of /Jgal. Amino 
acids can be thus ordered according to the the 
half-lives they confer on 0gal when present at its 
amino - terminus . For example,, the amino acids 

05 methionine, serine, alanine, threonine, valine, 

glycine and cysteine confer a half-life of more than 
20 hour-s . Pheny 1 alanine , leucine, asparagine, and. 
lysine yield half -lives of about three minutes. 
Arginine, the most destabilizing amino acid, confers 

10 a half -life of about two minutes. (See Table 1 below 
for complete list of amino acids and the correspond- 
ing half-lives) 

3 5 

A similar result is observed when S labelled 
proteins are synthesized in E. coli, isolated, and 

15 added to a mammalian cell lysate, specifically, the 
well characterized rabbit reticulocyte lysate 
system. In such a system, for example the following 
amino - terminal residues can be characterized as 
destabilizing: arginine, lysine, histidine, 

20 phenylalanine, leucine, tryptophan, tyrosine, 

alanine, serine, threonine, aspartic acid, glutamic 
acid, glutamine , cysteine and asparagine. Whether a 
particular amino acid is destabilizing in any 
eukaryotic system can be determined. 

25 Through the course of these studies, it has 

been determined that the tf-end rule has a 
hierarchical structure . Specifically , 
amino- terminal Glu and Asp (and also Cys in 
reticulocytes) are secondary destabilizing residues 
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in that they are destabilizing through their ability 
to be conjugated to primary destabilizing residues 
such as Arg. Amino- terminal Gin and Asn are 
tertiary destabilizing residues in that they are 

05 destabilizing through their ability to be converted, 
via selective deamidation, into secondary 
destabilizing residues Glu and Asp. 

Currently known amino - cerminal residues in 
long- lived , noncompar tmentaliz ed intracellular 

10 proteins from both prokaryotes and eukaryotes belong 
virtually exclusively to the stabilizing class of 
amino acids, exactly as predicted by the N-end rule. 
This result stongly implicates the N-end rule in the 
selective degradation of intracellular proteins in 

15 general . 

The appropriate amino - terminal amino acid 
appears to be an essential (though not necessarily a 
sufficient) requirement for the metabolic stability 
of a noncompartmentalized, intracellular protein. 

20 Thus, in order for a protein to be relatively stable 
intracellular^ , a stabilizing amino acid should be 
present at the am ino - te r minus . The presence of a 
destabilizing residue at the amino - terminus of a 
protein is often, though not always, sufficient for 

25 its metabolic des tab ilizat ion in vivo. When such 

destabilization occurs to a relatively small extent, 
further analysis shows either an insufficient 
accessibility of the amino - terminus or a lack of the 
second determinant: of the complete ami-no --terminal 
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degradation signal. In these Instances, this second 
determinant, which by Itself does not metabolically 
destabilize a protein, must be present in order for 
the half-life of a protein in vivo to be strongly 
05 dependent on the nature of its amino - terminal 
residue. The second determinant of the 
amino -terminal degradation signal was found to be a 
specific internal lysine residue. The ability of 
this critical lysine residue to serve as the second 
10 determinant was shown to be largely independent of 

unique amino acid sequences surrounding the residue. 
Instead, an essential features of the critical 
lysine residue includes its spatial proximity to the 
protein' s amino -terminus . 
15 The presence of a stabilizing amino acid at the 

amino- terminus at least in some cases (for instance, 
as observed for £-gal) will confer stability upon 
the protein. However, a stabilizing amino acid at 
the amino -terminus may not always confer a long 
20 half-life because other degradative pathways may be 
involved in determining the ultimate fate of the 
protein. For example, endopro teoly tic cleavages 
(cleavages outside of terminal regions of the 
protein) may result in exposure of a destabilizing 
25 * amino acid at the amino - terminus of a resulting 
product of the cleavage which is then rapidly 
degraded via the N-end rule pathway. Appropriate 
circumstances for use of a stabilizing amino acid 
can be ascertained empirically. 
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Although Che N-end rule may be only one com- 
ponent (albeit a central one) of a more complex 
"half-life rule" which embraces other aspects of 
selective protein degradation in vivo, the N-end 
05 rule provides a rational, practical approach for de- 
signing or changing protein structure in order to 
pro'duce proteins which are more or less resistant to 
degradation by the N-end rule pathway than natural, 
unmodified protein. Proteins can be designed or 
10 modified at the protein or gene level to provide a 
desired amino acid of either the stabilizing or 
destabilizing class at their amino - terminus . Where 
necessary for destabilization , additional modifica- 
tions can be made to the amino - terminal region to 
15 provide appropriately located lysine residues. The 
ability to regulate the half-life of a protein will 
allow one to modulate the intracellular activity of 
the protein. 

A straightforward approach to modifying a 
20 protein in order to increase or decrease its meta- 
bolic stability or to modulate other properties of 
the protein is to directly engineer the amino- 
terminus of the protein at the protein level. To 
provide a desired amino - terminal amino acid, the 
25 amino- terminus of the protein of interest can be 

chemically altered, for example, by adding an amino 
acid of the stabilizing or destabilizing class to 
the amino- terminus of a protein or polypeptide, era- 
ploying an appropriate chemistry. Thus, for . 
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example, an unstable protein can be rendered more 
stable by adding a stabilizing amino acid residue 
(e.g. methionine serine, alanine, threonine, valine 
glycine or cysteine) to the amino - terminus of the 

05 protein. Conversely, a stable protein can be 

destabilized by adding a destabilizing amino acid to 
the amino - terminus . One. distinct way to modify the 
amino- terminus of a protein would be to employ spe- 
cific enzymes, amino acid-protein ligases, which 

10 catalyze pos ttrans lational addition of a single 

amino acid to the protein's amino - terminus . Other 
methods for nongenetic alterations of the same type 
can readily be ascertained by those skilled in the 
art . 

15 in some proteins, Che amino - terminal end is 

obscured as a result of the protein's conformation 
(i.e., its tertiary or quaternary structure). In 
these cases, more extensive alteration of the 
amino - terminus may be necessary to make the protein 

20 subject to the N-end rule pathway. For example, 

where simple addition or replacement of the single 
amino - terminal residue is insufficient because of an 
inaccessible amino - terminus , several amino acids 
(including lysine, the. site of ubiquitin joining to 

25 substrate proteins) may be added to the original 

amino - terminus to increase the accessibility and/or 
segmental mobility of the engineered amino terminus. 

Modification or design of the amino - terminus of 
a protein can also be accomplished at the genetic 
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level. Conventional techniques of s ite - directed 
mutagenesis for addition or substitution of ap - 
propriate codons to the 5' end of an isolated or 
synthesized gene can be employed to provide a 

05 desired amino - terminal structure for the encoded 

protein. For example, so that the protein expressed 
has the desire-d amino acid at its amino- terminus the 
appropriate codon for a stabilizing amino acid can 
be inserted or built into the amino - terminus of the 

10 protein-encoding sequence. Where necessary, the DNA 
sequence encoding the amino - terminal region of a 
protein can be modified to introduce a lysine 
residue in an appropriate context. This can be 
achieved most conveniently by employing DNA 

15 constructs encoding "universal destabilizing 
segments". A universal destabilizing segment 
comprises a DNA construct which encodes a 
polypeptide structure, preferably segmentally 
mobile, containing one or more lysine residues, the 

20 codons for lysine residues being positioned within 
the construct such that when the construct is 
inserted into the structural gene, the lysine 
residues are sufficiently spatially proximate to the 
amino - terminus of the encoded protein to serve as 

25 the second determinant of the complete 

amino- terminal degradation signal. An example of a 
destabilizing segment is shown in the 
exemplification below (see Figures 7 and 8). The 
insertion of such constructs into the 5' portion of 
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a structural gene would provide the encoded protein 
with a lysine residue (or residues) in an 
appropriate context for des tabilization. 

At the same time, expressed proteins are often 

05 naturally modified within a cell after translation. 
These modifications can include changes at the 
protein's amino - terminus . For example, the amino - 
terminus can be acted on by an aminopept idas e which 
cleaves one or several amino acids from the amino- 

10 terminus. Amino acids, may also be added to the 
amino - terminus by pos t- translational processing. 
This invention provides a way to "by-pass" still 
undefined rules of amino - terminal protein processing 
to expose exactly and specifically the desired amino 

15 acid residues at the amino - terminus of a mature pro- 
cessed protein species. To minimize the impact of 
such pos ttranslational events on the ultimate 
structure of the amino - terminus of a protein of in- 
terest, specific fusion proteins can be designed 

20 wherein the amino -terminus of a protein of interest 
(designed to have the desired stabilizing or de - 
stabilizing structure) is preceded by a "masking" 
protein sequence fused to the amino - terminus . The 
fusion proteins are designed so that the masking. 

25 protein sequence fused to the amino - terminus of the 
protein of interest is susceptible to specific 
cleavage at the junction between the two. Removal 
of the protein sequence thus unmasks the amino- 
terminus of the protein of interest and the half- 



WO 89/09829 



PCT/US89/01468 



-15- 



life of the released protein is thus governed by the 
predesigned amino - terminus . The fusion protein can 
be designed for specific cleavage in vivo, for 
example, by a host cell endoprotease or for specific 

05 cleavage in a in vitro system where it can be 

cleaved after isolation from a producer cell (which 
lacks the capability to cleave the fusion protein). 

Ubiquitin is a broadly useful fusion partner 
for construction of a fused protein with a protein 

10 of interest: the discovery that artificial 

ubiquitin-protein fusions can be cleaved precisely 
by a cytoplasmic eukaryotic protease with little or 
no dependence on the protein to which ubiquitin is 
fused can be applied both in vivo and in vitro in 

15 protein engineering strategies, and is a major 
aspect of this invention. For example, the 
ubiquitin-protein fusion method can be used to 
artificially generate authentic amino - termini in 
proteins produced through artificial means. Thus. 

20 amino-termini characteristic of natural eukaryotic 

or prokaryotic proteins can be generated by in vitro 
cleavage of ubiquitin-protein fusions produced in a 

prokaryotic host. 

A specific methodology for producing ubiquitin- 
25 /3-galactosidase fusion proteins is described in 

detail below. Genes encoding any other proteins can 
be substituted for LacZ (the 0-gal gene) in this 
methodology . 
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In general, ubiquitlti fusion proteins are 
expressed by a chimeric gene construct comprising, 
in 5' to 3' orientation, a ubiquitin gene linked to 
a gene encoding the protein of interest. The codon 
05 for the amino - terminal amino acid of the protein of 
interest is located immediately adjacent the 3' end 
of the ubiquitin gene. The fused gene product is 
cleaved endopro teoly tically either in vivo or in 
vitro (using either pure or partially purified 
10 ubiquitin- specif ic protease identified in the 

present invention) at the junction between ubiquitin 
and the protein of interest to generate the protein 
of interest having the desired amino acid at its 
amino - terminus . 
15 There are a number of specific uses for the 

described ability to specifically engineer the 
protein's amino - terminus . One such use is es- 
tablished by the fact that the intracellular half- 
life of the released protein is governed by the 
20 principles of the N-end rule. Other applications of 
the specific method for engineering the protein's 
amino- terminus described herein range from adjusting 
the desired functional properties of a protein of 
interest, to modulating its antigenicity, and again, 
25 to other uses that can readily be ascertained by 
those skilled in the art. 

This method of generating the desired amino 
acid residue at the amino - terminus of a protein of 
interest involves irwo novel components: one,- the 
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use of ubiquitin-protein fusions, and the other, the 
use of ubiquitin-specif ic processing protease that 
has been identified, and whose striking substrate 
requirements were discovered, in this work.- Al- 

05 though the initial identification of the ubiquitin- 
specific protease has been made in vivo, the enzyme 
is also relatively stable and active in vitro (in 
extracts), and can readily be purified to 
homogeneity by techniques known to those skilled in 

10 the art. Furthermore, the substrate specificity of 
the ubiquitin-specif ic processing protease is highly 
conserved in evolution, being the same in yeast and 
mammals. The enzyme can be purified chromato- 
graphically from a crude extract by sequential 

15 chromatography on pho sphoc e 1 lul o s e , DEAE cellulose, 
and SH-Sepharose among other methods known to those 
skilled in the art. Alternatively, the gene for 
this protease can be cloned by those skilled in the 
art . 

20 The cloned protease gene can be used either in 

vivo, or, alternatively, the gene can be over- 
expressed in a suitable host, the overexpr essed 
ubiquitin-specif ic protease purified and used for 
the same or similar purposes in vitro. The dis- 

25 covery of this enzymatic activity, and detailed 

characterization of its substrate specificity herein 
provides for the in vitro and in vivo use of this 
enzyme . 
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The use of ubiqui tin-pro tein fusions to allow 
the generation of a desired amino acid residue at 
the amino -terminus of a protein of interest can be 
extended to facilitate the purification of such 

05 proteins from producer cells. A gene can be readily- 
constructed that encodes a convenient marker pro- 
tein., such as strep tavidin, linked to a ubiqui tin- 
protein fusion construct described above. The 
resulting (marker protein) -ub iqui tin-pro tein fusion 

10 can be simply isolated from producer cells by using 
the preselected property of the marker protein, for 
instance, the known ability of streptavidin to be 
isolatable by affinity chromatography on a biotin 
column. Thus, purified (marker pro tein) - ub iqui tin- 

15 protein fusion can then be specifically cleaved by 
the ubiquitin- specif ic protease described in this 
invention to generate the final product, a protein 
of interest with the desired amino acid residue at 
its amino- terminus . 

20 The codon for the amino - terminal amino acid of 

the protein of interest can be made to encode the 
desired amino acid by, for example, s i te- direc ted 
mutagenesis techniques currently standard in the 
field- If the gene encoding the protein of interest 

25 is a synthetic gene the appropriate 5' codon can be 
built-in during the synthetic process. Alter- 
natively, nucleotides for a specific codon can be 
added to the . 5 ' end of an isolated or synthesized 
gene by ligation of an appropriate DNA sequence to 
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the 5' (amino- terminus-encoding) end of the gene. 
DNA inserts encoding appropriately located lysine 
residues (such as the "universal destabilizing 
segments" described above) can be inserted into the 

05 5' region to provide for the second determinant of 
the complete amino- terminal degradation. 

Ubiquitin- like fusion partners capable of being 
cleaved by the ubiquitin- spec if ic protease can also 
be used. In addition, fusion partners other than 

10 ubiquitin for masking the amino - terminus of a 
protein of interest can be used. For example, 
functional homologues of ubiquitin from eukaryotes 
or prokaryotes may be used. In appropriate cases, 
the fusion proteins can be designed to contain a 

15 proteolytic cleavage site for a restriction endo- 
protease which has sufficiently narrow specificity 
so that only one target site is cleaved in a fusion 
protein. A crucial property of such a protease must 
be a sufficiently relaxed requirement for the nature 

20 of the amino acid residue(s) abutting the carboxy- 

terminal side of the cleavage site. The target site 
for cleavage is the junction between the fusion 
partner and the amino - terminus of the protein of 
interest and thus the recognition site for the 

25 endoprotease is located to provide for cleavage at 

this location. The commerically available protease, 
complement factor X a> exhibits these properties and 
thus can be used to directly generate proteins with 
predetermined amino acid residues in the ultimate 
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position of their amino- termini (see, K. Nogai and 
H.C. Thoger sen Nature 3 09:810 (1984)). The recog- 
nition site for the endoprotease can be engineered 
into the junction between the masking protein 

05 sequence and the 3' region encoding the amino - 
terminus of the protein of interest. 

A different and distinct method for engineering 
short-lived proteins is provided in this invention 
by the discovery that ubiquitin-pro tein fusions, 

10 such as ubiquitin-Pro-0-galactosidase fusion (Table 
1) , that cannot be efficiently deub iquit inated are 
metabolically unstable. Thus, by attaching the 
amino-terminal ubiquitin moiety to a protein in a 
way that makes its removal either impossible or 

15 inefficient, one can destabilize a protein by a 

distinct technique which is qualitatively different 
from the method of generating the desired amino- 
terminus of a pro tein according to the requirements 
of the N-end rule. Prevention of the efficient 

20 deubiquitination of a ubiquitin-pro tein fusion can 
be achieved in several ways, for instance, by using 
a proline residue at the ubiquitin-pro tein junction 
as shown in Table 1, or by changing the amino acid 
sequence of ubiquitin near its carb oxyl - terminus in 

25 such as way that the ubiquitin moiety is no longer 
recognized, by the ubiquitin- specif ic processing 
protease but can still be recognized by the rest of 
the degradative pathway. These and other ways to 
reduce the rate of deubiquitination of a ubiquitin- 
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protein fusion can be readily ascertained by those 
skilled in the art. 

The methods of this invention can be employed, 
inter alia, for regulating the half-life of a 
05 protein intracellular^ . There are many instances 
where this capability is useful. For example, when 
a gene is introduced into a cell for expression 
therein, the expressed product can be designed for a 
long or short half-life depending upon the particu- 

10 lar need. 

In general, destabilized proteins which have 
short half-lives are more amenable to regulation of 
intracellular levels of the protein. The ability to 
finely regulate the intracellular levels and ac- 

15 tivity of a protein can be useful in therapy or in 
the work with in vitro cell cultures. In gene 
therapy, for example, a gene may be introduced into 
a cell to compensate for a genetic deficiency or 
abnormality. The gene can be inserted under control 

20 of an inducible promoter. Induction results in 
enhanced expression of the gene product and con- 
sequently, higher levels of the product within the 
cell. If the gene is designed to encode an unstable 
protein, the intracellular concentration of the 

25 expressed protein will be more quickly responsive to 
a later reduction in the rate of its synthesis 
because it does not persist within the cell. In 
this way, the intracellular level and/or activity of 
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the protein encoded by the inserted gene can be more 
finely regulated. 

The method of this invention can also be used 
to expand the uses of selectable markers by shorten- 

05 ing the time necessary for a phenotype related to 
the marker to become manifest. Toward this end, a 
product, encoded: by a marlce-r g.ene can be destabilized 
by altering its amino - terminus according to the 
N-end rule. In this way selection for the negative 

10 phenotype can be facilitated because the product of 
the marker gene will be more quickly extinguished 
after the function of the gene encoding the marker 
is abolished. An example is the thymidine kinase 
(tk) gene. The tk gene can be engineered to encode 

15 a less stable enzyme by introducing an appropriate 
destabilizing amino acid at the amino - terminus . 
Gene mutation resulting in tk" phenotype will be 
more quickly manifested by cells because residual tk 
will be. more quickly degraded. This can be 

20 especially useful in slow growing cells where more 
time is required to "dilute out" tk synthesized 
prior* to transformation to the tk type. 

The principles of protein modification based 
upon the N-end rule may also be employed in the 

25 design of cytotoxins. Pro teinaceous cytotoxins can 
be designed as unstable proteins degradable by the 
N-end rule pathway so that they do not persist after 
their toxic action has been exerted on a target 
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cell. Reducing the lifetime of the toxin reduces 
the likelihood of killing nontargeted cells. 

Discovery of the N-end rule pathway of degra- 
dation allows development of mutant cells having 

05 mutations in genes encoding essential components of 
the N-end rule pathway. For example, cells can be 
produced that either permanently or conditionally 
are unable to efficiently degrade otherwise short- 
lived proteins. These cells can be used to produce 

10 desired proteins that ordinarily would be unstable 

within a cell. 

The invention is illustrated further by the 
following detailed description of the elucidation o: 
the N-end rule. 

15 Methods 

S . cerevisiae cells carrying pUB23 (Fig. 1), 
which encodes ub-Het-/3gal (Fig. 3A) , were labeled 
with [ 35 S] methionine, followed by extract pre- 

20 paration, immunoprec ip i tat ion of 0gal and elect- 
rophoresis as described below. The wet polyacryla- 
mide gel was subjected to autoradiography, the band 
of 0gal was excised, and the e lec troe lute d 0gal was 
subjected to six cycles of radiochemical sequencing 

25 by Edman degradation. The sequencing was carried 

out by W. Lane at the MicroChem Facility of Harvard 
University. 
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S ^irS-ft ^ggc tf ed> Mutagenesis 

pUB23 (figure 1) was treated sequentially with 
Acc I, the Klenow fragment of pol I, and Bam HI. A 
fragment containing the Xho I site was purified and 

05 inserted between a filled- in Hind III site and a BAM 
HI site of the M13mp9. phage DNA . (J. Messing and J. 
Vie-tra, Gene 19 , 263 (1982)). Site -direct e-<L muta- 
genesis (M. Smith, Annu. Rev. Ge net . 19, 423 (1985)) 
was carried out as described by Kramer, w*. et al. 

10 Nucl. Acids Res . 12, 9441 (1984) using a synthetic 
25 - residue oligodeoxyr ibonucleo tide , containing ten 
bases on the 5' side and twelve bases on the 3 r side 
of the Met codon of gal. All four bases were 
allowed to occur at the original Met codon positions 

15 during synthesis. Primary phage plaques were 

screened by hybridization. (Wood, N.I. et al.. PNAS * 
82, 1585 (1985)), with the use of a 12-residue 
oligonucleotide probe spanning the region of codon 
changes and hybridizing to the original sequence. 

20 Nonhybridizing plaques containing inserts of the 

expected size were sequenced by the chain termina- 
tion method. (Sanger, F. et al. , PNAS 71 5463 
(1977)). To transfer the desired constructs into 
the pUB23 background, replicative form DNA of mutant 

25 phages was digested with Xho I and Bam HI, and added 
to the same digest of the plasmid pLGSD5 - ATG (see 
Fig. 1 and L . Guarente , Methods Enzymol . , 101 181 
(1983)). The ligated mixture was used to transform 
the coli strain M.G1061. (M.J. Gasadaban and S.N. 
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Cohen, J. Mol. Biol., 138 179 (1980)). Colonies 
containing plasraids of interest (in which the open 
reading frame of £gal had been restored) were 
recognized by their light blue color on X-£gal 
05 plates. 

S . cerevisiae cells of the strain BWG-9a-l (MAT 
his4 ura3 ade6) , transformed (F. Sherman et a 1 . 
Me thods_in_Yeas t_Genetics Cold Spring Harbor Labo- 

10 ratory, N.Y., 1981)) with plasraids of interest were 
grown at 30°C to A 60Q of approximately 5 in a medium 
of 2 percent galactose, 0.67 percent Yeast Nitrogen 
Base without amino acids (DIFCO) , adenine (10 /xg/ml) 
and amino acids including methionine (Sherman, F. et 

15 -al., su£ra) . Typically, cells from a 5 ml culture 
were harvested by filtration through the well of a 
Millipore microtiter filtration plate, washed 
several times on the filter with the same medium 
lacking methionine and resuspended in 0.3 ml of 1 

20 percent galactose, 50 mM potassium phosphate buffer 
(pH 7.4). [ 35 S ]raethionine (50 to 100 fid) was then 
added for 5 minutes at 30°C; the cells were col- 
lected by filtration and resuspended on 0.4 ml of 
the growth medium containing cy c 1 oheximide at 0.5 

25 mg/ml. Samples (0.1 ml) were withdrawn at indicated 
times, and added to 0.75 ml of cold buffer A (see 
below for buffer composition) containing leupeptin, 
pepstatin A, antipain, aprotinin and chymostatin 
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(Sigma), (each at 20 /ig/ml) in addition to 0.4 ml of 

glass beads. Immediately thereafter, the cells were 

disrupted by vortexing for approximately 3 minutes 

at 4°C; the extracts were centrifuged at 12,000g for 

05 3 minutes and the radioactivity of acid- insoluble 
3 5 

S in the supernatants. was. determined. Aliquots of 

the' superna-tants containing equal amounts of" the. 

35 

total acid-insoluble S were processed for immuno- 
precipitation with a monoclonal antibody to 0gal. 

10 Ascitic fluid containing a molar excess of the 
antibody (at least tenfold) was added to each 
aliquot, with subsequent incubation at 4°C for 2 
hours; protein A-Sapharose (Pharmacia) was then 
added, the suspension was incubated with rocking at 

15 4°C for 30 minutes and centrifuged at 12,000g for 1 
minute. The protein A-Sepharose pellets were washed 
three times in buffer A (see below) containing 0.1 
percent sodium dodecyl sulfate (SDS), resuspended in 
an SDS, di thio tr e ito 1 (DTT )- containing electro- 

20 phoretic sample buffer (U.K. Laemmli, Nature 227 6S0 
(1970)), heated at 100°C for 3 minutes, and cen- 
trifuged at 12,000g for 1 minute. Equal aliquots of 
the supernatants were subjected to electrophoresis 
in a 7 percent discontinuous po lyacrylamide - SDS gel 

25 (15 by 15 by 0.15 cm) with subsequent f lourography . 
In some experiments, the above protocol was not 
used, but the extracts were prepared by boiling 
cells directly in the presence of SDS, with es- 
sentially the same results. 
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Plasmid pUB23 (Figs. 1 and 3) was introduced 
into DS410, a rainice 1 1 - p r o due ing E. coli strain. 
(N. Stoker, et aL, in Transcri£tion_and^Trans- 

05 latioTK A - £ractical A££roach B. D. Harnes and S. J. 
Higgins, Eds., IRL press, Oxford, 1984, p. 153). 
Minicells were prepared, and labeled for 60 minutes 
at 36°C with [ 3 5 S ] me thionine ( 600 Ci/mmole, 
Amersham) as described by N. Stoker et al . , s u£ r a . 

10 Labeled minicells were centrifuged, resuspended 

in 2 percent SDS , 10 mM DTT, 10 mM Na-HEPES (ph 7.5) 
and heated at 100°C for 3 minutes. After centri- 
fugation at 12,000g for 1 minute the supernatant was 
diluted 20-fold with buffer A. (1 percent Triton 

15 X-100, 0.15 H NaCl, 5 mM Na-EDTA, 50 mM Na-HEPES , pH 
7.5), followed by the addition of pheny Ime thy 1 - 
sulfonyl fluoride (PMSF) and N-ethylraaleimide to 0.5 
mM and lOmM, respectively. After 4 hours at 4°C , 
the sample was dialyzed against buffer A containing 

20 0.5 mM PMSF overnight at 4°C , and processed for 
immunoprecipi tation (as described above). 

Analy_s is_o f_ubj_££a^ 

S. cerevisiae. cells carrying plasmids of interest 
were grown in 800 ml of a uracil-def icient medium, 
25 then harvested and disrupted with glass beads in 
buffer A containing leupeptin, pepstatin A, anti- 
pain, aprotinin and chymostatin (each at 3 /ig/ral). 
The extract was centrifuged at 12,000g for 3 
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minutes . Saturated ammonium sulfate was added to 
the supernatant to a final concentration of 57 
percent. After overnight incubation at 4°C, the 
precipitated protein was collected by centr if ugation 
05 at 23,000g for 30 minutes. The pellet was redis- 
solved in buffer A containing protease inhibitors. 
After cla.rifica.tion. at 12,000g fo.r 3 minutes, the 
sample was passed through an affinity column which 
had been prepared by crosslinking an IgG fraction 
10 from an ascitic fluid (containing a monoclonal 

antibody to gal to Affi-Gel 10 (Bio-Rad). The IgG 
fraction used for crosslinking had been purified 
from the ascitic fluid by affinity chromatography on 
protein A-Sepharose. After washing with buffer A 
15 lacking Triton X-100, the antibody-bound proteins 
were eluted with 0,25 M glycine-HCl (pH 2.6). The 
eluate was immediately adjusted to pH 7.5 with 1 M 
Na-HEPES (pH 8.5), and thereafter made 0.1 percent 
in SDS. The sample was concentrated by ultrafil- 
20 tration in Centricon 30 (Amicon) , and subjected to 
electrophoresis in a 7 percent discontinuous poly- 
acrylamide-SDS gel (U.K. Laemmli, Nature (London! 
227, 680 (1970)). . Electroblo tting of proteins to 
nitrocellulose, and immunoblot analysis with a 
25 peptide-mediatad antibody to ubiquitin were per- 
formed as described by P.S. Swerdlow, D. Finley and 
A. Varshavsky, Analyt , Biochem. 156, 147 (1986). 
The same results were obtained with a different 
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antibody to ubiquitin obtained from A. Haas (Univ of 

Milwaukee Med. School). 
Cons true tion_of_E^_coli_Ex£res si 
Twenty_Ub :l X^£al_Fusion_Proteins 
05 Four of the pKKUb -X-/3gal vectors (those 

encoding Ub-MET-/?gal , Ub - Gin- /9gal , Ub - Ar g - /3gal , and 
Ub-Pro-0gaL) were constructed as follows. Site- 
directed mutagenesis [M. Smith, Annu^_Rev^_Gene t A 
19, 423 (1985); T. Maniatis, et al^, "Molecular 
10 Cloning", (Cold Spring Harbor Laboratory, N.Y., 
1982); "Current Protocols in Molecular Biology", 
F.M. Ausubel, et al. , (Wiley- Interscience , N.Y., 
1987] was used to insert the sequence GTAC between 
the first and second codons of the ubiquitin reading 
15 frame in the yeast expression vector pUB23 (which 
encodes Ub-Met-^gal) and in its derivatives, (A. 
Bachmair, et al., Science 2 3 4, 179 (1986), encoding 
Ub-Arg-^gal, Ub-Gln-0gal, and Ub - Pro -/3gal . The 
insertion created a Kpn I site positioned such that 
20 when the vector is cut with Kpn I and the ends 

blunted by mung bean nuclease, the second codon of 
the ubiquitin reading frame starts precisely at one 
of the fragment's ends. Thus , diges tion of each of 
the above four vectors with Kpn I and Tth 1111, 
25 followed by treatment with mung bean nuclease, 
yielded four fragments which contained the 
corresponding Ub-X-0gal-coding sequences but lacked 
the first (ATG) codon of the ubiquitin reading 
frame. These fragments were subcloned into an 
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coli expression vector pKK233-2 [E. Amann and J. 
Brosius, Gene 40, 183 (1985)] which had been 
prepared by digesting it with Nco I and filling in 
staggered ends using Klenow fragment of Pol I [M. 

05 Smith, Annu. Rev, Genet. 19., 423 (1985); T. 

Maniatis , et al . , • "Molecular Cloning", (Cold Spring 
Harbor Laboratory, N.Y., 1982); "Current Protocols 
in Molecular Biology" , F.M. Ausubel, et al ._ , 
(Wiley-Interscience , N.Y., 1987)]. This step 

10 yielded the complete Ub-X-£gal sequence (in which 
the ATG codon was supplied by the pKK233-3 vector), 
optimally positioned downstream of the regulatable 
Ptrc promotor of the vector. To construct the 
remaining sixteen pKKUb -X-/3gal expression vectors, 

15 pKKUb-Arg-jSgal was digested with Sail and BaroHT. 
One of the two BaraHI sites in pKKUb - Arg-/3gal is 
located at the junction between the ubiquitin- and 
0gal-coding sequences; the other BamHI site, present 
in the initial pKK233-2 vector [E. Amann and J. 

20 Brosius, Gen e 40, 183 (1985)], was removed in a 

preliminary construction step. The small Sall/BamHI 
fragment (containing the Ptrc promoter, the complete 
ubiquitin-coding sequence, and the Arg codon at the 
Ub-jSgal junction) was subcloned into a M13mp9 vector 

25 [M. Smith, Annu. Rev. Genets 19, 423 (1985); T. 

Maniatis, et al^, "Molecular Cloning", (Cold Spring 
Harbor Laboratory, N.Y. , 1982); "Current Protocols 
in Molecular Biology", F.M. Ausubel, et aL , 
(Wiley-Interscience , , (1987); J. Messing* and J. 
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Vieira, Gene 19, 263 (1982)]. A BstXI/BamHI 
fragment of this construct that contained a portion 
of the ubiquitin-coding sequence and the Arg codon 
at the Ub-/3gal junction, was then exchanged for the 

05 sixteen otherwise identical BstXI/BamHI fragments 
[from the previously made, M13mp9-based constructs 
A... Bachmair et al-,, Science 2 34, 179 (1986)], which 
differed exclusively in a codon at the Ub-/9gal 
junction. The resulting sixteen M13 -mp9 -based 

10 constructs were treated with Sail and BamHI , and the 
small Sall/BamHI fragments containing, the 
ubiquitin-coding sequence and different single 
codons at the Ub-/3gal junction were cloned back into 
pKKUb - Arg - /3gal , replacing the original Sall/BamHI 

15 fragment, and yielding the remaining sixteen 

pKKUb-X-/3gal expression vectors. In all cases, the 
identity of the amino acid encoded at the Ub-/3gal 
junction of a final pKKUb - X -/5c on s true t was verified 
by subcloning into M13 and nucleotide sequencing by 

20 the chain termination method [ M . Smith, Annu^Jev^ 
Genet. 19, 423 (1985); T. Maniatis, et a L , 
"Molecular Cloning", (Cold Spring Harbor Laboratory, 
N.Y., 1982); "Current Protocols in Molecular 
Biology", F.M. Ausubel, et al. , (Wiley - Inter sc ience , 

25 N.Y., (1987)]. 



Purif ication_of_„^ 
E . coli 
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An overnight culture. (1ml) of E. coli JM101 
cells bearing one of the twenty pKKUb -X-0gal 
expression vectors was diluted into 50 ml of Luria 
broth supplemented with, ampicillin at 40 pg/ml, and 

05 the cells were grown with shaking for approximately 
2 hours at 37°C. The cells were harvested by 
centirifugation at 4,..000g for:: 10. minutes, washed 
twice with M9 buffer, and resuspended in 25 ml of M9 
minimal medium supplemented with glucose (0.22%, 

10 w/v) , thiamine (18 fig/ml) , ampicillin (40 pg/ml) , 

0.5mtt isopropylthiogalactoside (IPTG) , and 0.15 ml 

of 10.5% (w/v) Methionine Assay Medium (Difco) . 

After incubation with shaking for one hour at 37* C, 

0.5 to 1.0 MCi of 35 S-Translabel (ICN:~85% [ 35 S] 

3 5 

15 methionine, -15% [ S] cysteine) was added and 
shaking was continued for 5 minutes. Unlabeled 
L-methionine was than .added to lmtt and shaking was 
continued for another 10 minutes. Cells were 
harvested, washed twice with M9 buffer, and 

20 resuspended in 0.5 ml of 25% (w/v) sucrose, 50mM 
Tris-HCl (pH8.0). Thereafter, 0.1 ml of lysozyme 
(lOmg/ml, Sigma) in 0.25 M. Tris-HCl (pH 8.0) was 
added, and the mixture was incubated at 0° C for 
five minutes, followed by the addition of 0.1 ml of 

25 0.5 M Na-EDTA (pH 8 10) and further incubation at 0°C 
for five minutes. The cell suspension was then 
added to a lysis solution (0.8 ml H 2 0 , 50 pi of 1 M 
Tris-Hcl (pH 8.0), 125 pi of 0.5 M Na-EDTA (pH 8.0), 
10 pi of 10% (w/v) Triton X-100) , and gently mixed. 
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The lysate was centrifuged at 40,000g for one hour, 
and Ub-X-/3gal was purified from the supernatant by 
affinity chromatography on an aminopheny 1 thio - 
pyranogalactoside-agarose (APTG-agarose) column as 

05 described (A. Ullman, Gene 2 9, 27 (1984). 

Ubiquitin-X-0gal was eluted from APTG-agarose with 
10 mM 2-mercaptoethanol, 0 . 1 M Na-borate (pH 10.0), 
dialyzed overnight at 4°C against 50% (v/v) 
glycerol, 0.1 raM EDTA, 1 mM di thio thr e i t o 1 (DTT), 40 

10 mM Tris-HCl (pH 7.5), and stored at -20°C in the 
same buffer. Control experiments showed that the 
transient exposure of Ub-X-£gal purified by the 
above proceudre were 0.5 - 1 mg , with enzymatic 
activity of 4 - 6 x 10 units/mg and specific 

15 radioactivity of 1 - 2 X 10 5 cpm/pg. Unlabeled 
Ub-X-0gal was prepared essentially as described 
above except that after two hours of growth in Luria 
broth with ampicillin, IPTG was added to 0.5 raM, and 
the cells were grown for one more hour before 

20 harvesting and lysis. 

2i£iadation_of_TestJroteins 

Washed reticulocytes from pheny Ihydr az ine - 
treated rabbits were purchased from Green Hectares 
25 (Oregon, Wisconsin), and shipped overnight at 0°C. 
The reticulocytes were washed three times with 3-4 
volumes of standard phosphate - buff ered saline (PBS) 
(centrifugations at lOOOg for 10 minutes at 4 9 C). 
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To deplete intracellular ATP [J. Etlinger and A. 
Goldberg, Proc^ Natl, .Acad, Sci. USA, 74, 54 (19 77); 
A. Hershko, et al . ., Proc. Natl, Acad. Sci._USA_77 : 
1783 (1980) r Hershko et al . , j[j.,Biol . _Chem . , 25 8 , 
05 8206 (1982)], the cells were incubated for 90 

minutes at 37° C in Krebs -Ringer phosphate buffer 
containing 0.2. mM 2 , 4 - dinitrophenol and 20 mM 
2 - deoxyglucose , and then washed three times in PBS. 
Pelleted reticulocytes were then lysed at 0°C by 
10 resuspending the pellet in 1.5 volumes of 1 mM DTT. 
After -10 minutes at 0°C, the sample was centrifuges 
at 80,000g for 90 minutes at 4°C. The supernatant 
was removed, divided into aliquots, and stored under 
liquid nitrogen. Only once-frozen aliquots were 
15 used in all experiments. Unless stated otherwise, 
the ATP-depleted reticulocyte extract was used 
directly after thawing, without further processing. 
In some experiments, the thawed extract was at first 
dialysed overnight at 4*C against 1 mM DTT , 10 mM 
20 Tris-HCl (pH 7.5) in dialysis tubing with a m.w. 

cutoff of -3 kD. Fraction II was prepared by DEAE 
chromatography of ATP-depleted reticulocyte extract 
as previously described [D. Fihley and A, 
Varshavsky, Trends Biochem. Sci. 10, 343 (1985); A. 
25 Herschko and A. Ciechanover, Progr. Nucl. Ac. Res. 
Mol . Biol . 33, 19 (1986); S. Pontremoli and E. 
Meloni, Annu. Rev ;ii Biochem, 55, 45 5 (19 86); M. 
Rechsteiner, Aiinu._R.ev, Cell. Biol. 3, 1 (1987); 
J . S . Bond and. P . E , Butler , Annu. Rev . Bi o chem ,. 56 , 



WO 89/09829 



PCT/US89/01468 



-35- 

333 (1987); J. P. Dice, FASEW^l, 349 (1987); J. 
Etlinger and A. Goldberg, Pr oc_ L- Natl J __Acad_ L _Sc J_ 
USA , 74, 54 (1977); A. Hershko, et al if Proc^_Natl^ 
A £ a d^_S c i^_U S A_7 7 : 1783 (1980); Hershko et^al., 

05 B i o l^__Ch enK , 2 5 8, 8206 (1982)] and stored under 

liquid nitrogen. Reaction mixtures for assaying the 
degradation of test proteins in either the total 
reticulocyte extract of Fraction II contained (final 
concentrations) 5% (v/v) glycerol, 1 mM DTT , 5 mM 

10 MgCl 2 , 50 mM Tris-HCI (pH 7.5.), 70% (v/v) 

reticulocyte extract (or Fraction II at 6 mg/ml of 
the total protein), [ 35 S ] Ub -X-/3gal fusion protein at 
20 /ig/ml, and when present, 0.5 mM ATP and an 
ATP-regenerating system (10 mM creating phosphate, 

15 0.1 mg/ml creatine phosphokinas e ) . Reaction 
mixtures were prepared as follows: a mixture 
complete except for ATP and ATP-regenerating system 
was incubated for ten minutes at 37°C; to allow for 
the deubiquitination of a Ub-X-£gal fusion protein 

20 ATP and ATP-regenerating system were then added to 
start the ATP - dependent reactions in the extract and 
the 37°C incubation continued. Control reactions 
with the ATP-depleted extract were performed 
identically except that ATP and ATP-regenerating 

25 system were omitted. The ATP - dep enden t degradation 
of 125 l-labeled bovine serum albumin, hen lysozyme, 
and cytochrome c from S^ce r evis i ae (purchased from 
Sigma, St. Louis, Missouri, and labeled using the 
chloramine T method (A. Ciechanover, et a L , P-roc. 
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Natl. Acad. Sci. USA 77. 13 6 5 (19 80) was assayed as 
described above except that the 10-minute 
preincubation of the test protein at 37 °C in the 
ATP-depleted reticulocyte extract was omitted. To 

05 follow the degradation of test proteins, aliquots 
were taken, from the reaction mixture at the 
indicated times, and eLther- assrayecL f or the amount 
of 5% TCA-soluble radioactivity present, or analyzed 
by SDS-PAGE [U.K. Laemuli, Nature 227: 680 (1970)] 

10 (8% polyacrylamide , 0.05 bisacrylamide , 15x15x0.15 
cm gels), with subsequent . fluoro graphy . 

Detailed Description of the Figures 

Figure 1 shows construction of a ub iquitin- lacZ 
gene fusion. pUB2, a pBR322-based genomic DNA clone 

15 (E. Ozkaynak, et al. Nature 312, 663 (1984) contains 
six repeats of the yeast ubiquitin- coding sequence 
(open boxes) together with the flanking regions 
(jagged lines). pUB2 was modified as shown in the 
diagram by placing a Bam HI site six bases down- 

20 stream from the first ubiquitin repeat. This 
allowed the construction of an in- frame fusion 
(confirmed by nucleotide sequencing) between a 
single ubiquitin repeat and the lacZ gene of the 
expression vector pLGSD5 - ATG (called G2 in L. 

25 Guarente, Metho ds Enzy mol . 101 181 (1983)). The 
term "2/im" denotes a region of the pLGSD-ATG that 
contains -the replication origin and flanking se- 
quences of the yeast plasmid called 2pm circle (See 
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L. Guarente, sujra). Figure 3B shows the amino acid 
sequence of the fusion protein in the vicinity of 
the ubiquitin-/9gal junction. 

Figure 2 shows that the in vivo half-life of 

05 /3gal is a function of its amino - terminal residue, 
(lane a) Minicells isolated from an E. coli strain 
carrying pUB23 , the initial. ub-LacZ fusion. (Figs. 1 
and 3B), were labeled with [ 3 5 S ] me thionine for 60 
minutes at 36°C, with subsequent analysis of 0gal as 

10 described. The same result was obtained when the 
labeled minicell SDS extract was combined with an 
unlabeled yeast SDS extract before immunop r e c ip i t a - 
tion of 0gal. (lane b) S. cerevisiae cells carrying 
pUB23 (Fig. 1), which encodes ub-Met-/3gal (Fig. 3B), 

15 were labeled with [ 3 5 S ] me thi onine for 5 minutes at 
30°C, with subsequent analysis of /3gal . The same 
result was obtained with the lengths of the 
[ 35 S ] methionine labeling periods from 1 to 30 
minutes, and with yeast extracts produced either by 

20 mechanical disruption of cells in the presence of 

protease inhibitors or by boiling the cells directly 
in an SDS - containing buffer. (lane c) Same as lane 
a but with E. coli cells carrying the control 
plasmid pLGSD5 (called Gl in L. Guarente, s u£r a . ) 

25 which encodes £gal. (lanes d to g) S. cerevisiae 

cells carrying pUB23 (Fig. 1), which encodes ub-Met- 
0gal (Fig. 3A) , were labeled with [ 3 5 S ] me thi onine 
for 5 minutes at 30°C (lane d) followed by a chase 
in the presence of cy cloheximide for 10, 30, and 60 
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linates (lanes e to g) , extraction, immunoprecipita- 
tion, and analysis of jSgal. (lanes h to j ) Same as 
lanes d to f, but with ub-Ile-/9gal (see Fig. 3A) . 
(lanes k to m) Same as lanes h to j , but with 

05 ub -Gln-£gal . (lanes n to q) Same as lanes d to g, 
but with ub -Leu-0gal . (lanes r to u) Same as lanes 
d; tro: but: with, ub - Arg -0ga:L. Deslgnatiotis.-: or*i; 
origin of the separating gel; ub , ubiquitin; 0gal, 
an electrophoretic band of the £gal protein con- 

10 taining a specified amino - terminal residue; in this 
terminology, the ttet-£gal portion of ub-Met-0gal is 
designated as 0gal. Arrowheads denote a 
metabolically stable, about 90kD degradation product 
of 0gal which is formed apparently as the result of 

15 an in vivo endopro teolytic cleavage of a proportion 
of short-lived gal proteins such as Leu-/3gal and 
Arg-0gal (lanes n to u) . 

Figure 3 shows the changing amino acid residues 
of gal . at the ub iqui tin-/3gal junction. (A) The 

20 initial plasmid, pUB23 (Fig.l), which encodes 

ub-Ket-^gal, was mutagenized as described above to 
convert the original Met codon ATG at the ub- gal 
junction into codons specifying 19 amino acids other 
than Met. (The original round of mutagenesis shown 

25 in Figure 3, produced 15 out of 19 possible sub- 
stitutions. The remaining four substitutions were 
produced later (see Table 1)). The arrowhead in- 
dicates the site of the deubiquitinating in vivo 
cleavage in the nascent fusion protein that occurs 
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with all of the fusion proteins except ub-Pro-^gal 
(see text). All of the constructions shown encode 
His as the second gal residue. In addition, in 
some of the constructions (ub -Me t -His - Gly -0gal , 
05 ub-Met-Gln-Gly-0gal, and ub-Met-Gln-His -Gly-^gal , 
the last one produced by an insertion mutation, see 
Table 3), either His or Gin were following Met at 
the ubiquitin-/3gal junction, with indistinguishable 
consequences for the metabolic stabilities of the 
10 corresponding /3gal proteins. (B) The amino acid 

sequence (in s ingle - letter abbreviations) of ub-Met- 
/3gal, the initial fusion protein (Fig. 1), in the 
vicinity of the ub-/Jgal junction. S ingle - le tter 
amino acid abbreviations: A, Ala; C, Cys ; D, Asp; E , 
15 Glu; F , Phe; G, Gly; H, His; I, He; K. Lys ; L, Leu; 
M, Met; N, Asn; P, Pro; Q, Gin; R, Arg; S, Ser; T, 
Thr; V, Val ; W, Trp; Y, Tyr . 

Figure 4 shows that ubiquitin-0gal is short- 
lived if not deubiquitinated. (lanes a to g) S. 
20 cerevisiae cells carrying plasmids encoding ub - 
X-/3gal fusion proteins in which X is the residue 
indicated at the top of each lane, were labeled for 
5 minutes at 30°C with [ 3 5 S ] me thionine , followed by 
extraction, iramunoprecipitation and analysis of 
25 /3gal. Fluorographic exposures for these lanes were 
several times longer than those for similar patterns 
in Figure 2 to reveal the multiple ub iqui tination of 
short-lived pga.1 proteins. (lanes h, i) Fluoro- 
graphic overexposure of lanes n, o in Fig. 2 



to 
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reveal the "ladder" of multiply ubiquitinated 

Leu-/Jgal proteins in a pulse-chase experiment (zero 

and 10 minutes chase, respectively), (lane j) Same 

as lanes a to g, but with ub-Pro-0gal. (lane k) 

05 Same as lane j, but with ub - Gln-£gal . (lane 1) Same 

as lane j . (lanes m to p) S~ cerevisiae cells 

carrying- a plasmid. encoding, ub -Pro -/?gal were labe-led 

3 5" 

for 5 minutes at 30 C with [ S]methionine (lane m) 
followed by a chase in the presence of cycloheximide 

10 for 10, 30, and 60 minutes (lanes n to p). The 
upper small arrow to the right of lane p denotes. 
ub-Pro-/3gal , a small proportion of which is still 
present after 1 hour chase. The lower small arrow 
indicates an apparently deubiquitinated Pro-0gal 

15 that slowly accumulates during chase and is meta- 
bolically stable. The dot to the left of lane m 
denotes an endogeneous yeast protein that is pre- 
cipitated in some experiments by the antibody used. 
Square brackets denote the multiply ubiquitinated 

20 0-gal species (see Fig. 5). Other designation are 
as in Figure 2. 

Figure 5 shows the "ladder" 0gal species 
containing ubiquitin. (lane a) S. cerevisiae cells 
carrying a plasmid .which encodes ub-Gln-^gal, were 

25 grown and disrupted, and the extracts processed for 
isolation of /?gal proteins by affinity chromato- 
graphy on a column with immobilized antibody to 
£gal. The £gal proteins thus obtained were electro- 
phoresed in a polyacrylamide-SDS gel, transferred to 
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nitro-cellulose , and probed with an antibody to 
ubiquitin. (lane b) Same as lane a, but with 
ub-Pro-/9gal . (lane c) Same as b but a longer 
autoradiographic exposure. (lane d) S. cerevisiae 

05 cells carrying a plasmid which encodes ub-Leu-^gal 
were labeled with [ 35 S ] methionine for 5 minutes, 
with subsequent extraction, imraunoprecipitatlon and 
electrophoresis of 0gal (the same sample as in 
Figure 4, lane f ) . Square brackets denote the 

10 multiply ub iqui tinated Gln-/3gal species detected 

with antibody to ubiquitin. The arrow indicates the 
band of ub-Pro-/3gal , the initial fusion protein seen 
in lanes b and c. The arrowheads indicate the 
position of the band of deub iqui t ina t e d 0gal (de- 

15 tectable by either Coomassie staining or metabolic 
labeling, but not with antibody to ubiquitin) 
derived from the ub-Gln-/3gal fusion protein. 

Figure 6 shows both prokaryotic and eukaryotic 
long-lived intracellular proteins have stabilizing 

20 amino acid residues at their amino - termini , whereas 
secreted proteins exhibit a complementary bias. 

(A) 208 long-lived, directly sequenced, intra- 
cellular (noncompartmentalized) proteins with 
unblocked amino - termini from both prokaryotes (77 

25 proteins) and eukaryotes (131 proteins) were distri- 
buted into three groups according to the nature of 
their amino - terminal residues as defined by the 
N-end rule (Table 1). All of the long-lived intra- 
cellular proteins examined bear exclusively 
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stabilizing residues at their amino - termini . In 
panels B to D , analogous diagrams are presented for 
243 secreted eukaryotic proteins (B) , for 37 light 
and heavy immunoglobulin chains (C) , and for 94 

05 secreted eukaryotic toxins (D) . Entries in C and D 
are subsets of entries in B . For proteins in B to 
D, th.e- amino -termini compiled correspond., whenever 
the assignment is possible, to the most processed 
form of a protein that is still located within a 

10 secreting cell. The data in A to D were manually 
compiled from the entire set of complete protein 
sequences available before 1981. The same con- 
clusions have been recently reached after a more 
detailed and extensive, computer - ass is ted tabulation 

15 of protein amino - termini using the current National 
Biomedical Research Foundation database. The 
amino - terminal residues of Asn, Cys , His, and Trp 
were excluded from the compilation because in vivo 
half-lives of the corresponding £gal proteins are 

20 still unknown (see, however, the legend to Table 1) . 
Inclusion of the residues (Table 1) into a recently 
compilation of the s*ame type did not change the 
original conclusion. Although the amino - terminal 
Pro was also excluded from the compilation, Pro 

25 appears to be a stabilizing residue for 0gal (Table 
1) , consistent with the frequent presence of Pro at 
the amino- termini of long-lived noncomp ar tmentalized 
proteins. 
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Figure 7 shows the construction of ubiquitin 
fusions with mouse dihydro folate reductase. 

Figure 8 shows a greater range of specific 
dihydrof olate reductase (DHFR)-based protein con- 

05 structs which have in common the DHFR moiety, and 
differ exclusively in the specific amino - terminal 
extensions: attached to DHFR. Structure I is an. 
initial DHFR' construct. Structure II contains the 
40-residue amino - te rminal extension derived from the 

10 amino- terminus of 0gal (see Figure 3). Structures 
III-V are variants of Structure II in which either 
one or both of the lysine residues (denoted as K in 
the single-letter code, see the legend to Figure 3) 
were replaced by the arginine residues (denoted as* 

15 R) Structures V-X are variants of the Structure II 
with increasing deletions in the carboxyl - terminal 
half of the ^gal-derived extension. Structures 
XI-XIII are variants of Structure II with increasing 
deletions in the amino - terminal half of the /3gal- 

20 derived extension. S ingle - letter amino acid desig- 
nations of the amino- termini of Structure I-XIII 
indicate variants of these protein constructs that 
differ exclusively in their amino - terminal residues. 
These variants were obtained through the use of the 

25 ubiquitin-protein fusion approach described herein 
(see Figure 3). Each of the Structures I-XIII was 
constructed at the DNA level using s i t e - dir ec te d 
mutagenesis and other methods of recombinant DNA 
technology currently standard in the field. T.hese 
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DNA constructs were Introduced into the yeast S. 
cerevisiae, and the half- lives (left column of 
Figure 8) of the corresponding proteins 1-XIII were 
directly determined using the methods described 
05 above for /3gal and a monospecific antibody to DHFR. 

R esulr s and Discussion 

Rap_id i _in_yivo deubiquitination of a_nascent ubiji 
quitin- flp ;al fusi on protein 

Branched ubiquitin conjugates in which the 

10 carboxyl- terminal glycine of ubiquitin moieties is 
joined via an isopeptide bond to the a-amino groups 
of internal lysine residues in proteins apparently 
comprise the bulk of ubiquitin conjugates in eu- 
karyotic cells. Joining of ubiquitin to the amino- 

15 terminal a-amino groups of target proteins, to yield 
linear ubiquitin conjugates, may also be chemically 
feasible. See A. Hershko , et al. , PNAS USA 81: 7021 
(1984). Whether or not linear ub-iquitin-protein 
fusions are actually synthesized in vivo through 

20 posttranslational enzymatic conjugation of ubiquitin 
to protein amino - termini , such proteins can also be 
produced by constructing appropriate chimeric genes 
and expressing them in vivo. Construction of one 
such gene, which encodes yeast ubiquitin linked to 

25 /3gal of Escher ichia coli , is shown in Figure 1. 

When this gene is expressed in E. coli, the 
resulting 0gal- containing protein has an apparent 
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molecular mass which is approximately 6 kD greater 

that that of the control £gal , a value consistent 

with the presence of ubiquitin in the protein 

encoded by the chimeric gene. In contrast, when the 

05 same gene is expressed in yeast, the corresponding 

0gal protein is alec trophore t ically indis t inguish - 

able from the control /Jgal. This, result, is in- 

3 5 

dependent of the length of the [ S]methionine 
labeling period (between 1 and 30 minutes). 

10 Furthermore, determination of the amino - terminal 

residue in the putative Met-£gal (half-life, t^ 20 
hours) by Edman degradation of the in vivo - labeled , 
gel-purified 0gal (Figure 2, lane d) directly 
confirmed the presence of the expected Met residue 

15 (Figure 3A and Table 1) at its amino - te rminus . 

Independent evidence that ubiquitin cleavage of the 
fusion protein occurs immediately after the last Gly 
residue of ubiquitin is presented below. We con- 
clude that in yeast, ubiquitin is efficiently 

20 cleaved off the nascent ubiqui t in- 0gal fusion 
protein, yielding a deub iqui tinated 0gal . The 
absence of the deubiqui tina tion reaction in E. coli 
is consistent with other lines of evidence in- 
dicating that prokaryotes lack both the eukaryotic 

25 ubiquitin and ubiqui tin- spec i fic enzymes. At the 

same time, the possibility remains that a functional 
counterpart of ubiquitin exists in bacteria but is 
different in its amino acid sequence from that of 
eukaryotic ubiquitin. The present invention clearly 
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applies not only to the extremely close amino acid 
homologs of ubiquitin such as those found in 
eukaryotes but also to functional homologs of 
ubiquitin such as those that may exist in bacteria. 

05 The ubiquitin-0gal junction encoded by the 

chimeric gene,. Gly-Me.t (Figures 1 anjd 3B) , is 
Ldenrtxcrail to- titer j uujcfcroms- between: adjacent rep.ea.ts. 
in the polyubiquitin precursor protein, which is 
efficiently processed into mature ubiquitin. Thus 

10 it is likely that the same protease, as yet un- 

characterized biochemically, is responsible both for 
the conversion of polyubiquitin into mature ubi- 
quitin and for the deubiquitination of the nascent 
ubiquitin-/3gal protein. If so, one potential way to 

15 inhibit the in vivo deubiquitination of the ubi- 
quitin~/?gal (and thereby to allow analysis of 
metabolic consequences of a stable ubiquitin attach- 
ment to jSgal) would be to convert the Met residue of 
0gal at the ub iqui tin-0gal junction (Figure 3B) into 

20 other amino acid residues (Figure 3A) . The unex- 
pected results of such an approach are described 
below. 

The I n yi vo_half- lif eof _ggal is_a f unc tiqn_of _ its 
amino- terminal residue^ The ATG codon which spec i - 
25 fies the original Met residue of gal at the ubi- 
quitin junction (Figure 3B) was converted by site- 
directed mutagenesis into codons specifying 19 other 
amino acids (See Figure 3A and Table 1). These 
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10 



constructions differ exclusively in the first codon 
of /3gal at the ubiquitin-/3gal junction (Figure 3A) . 
After each of the 16 plasmids thus designed was 
introduced into yeast, analysis of the corresponding 
05 0gal proteins pulse - labeled in vivo led to the 
following results (Figures 2, 4, and Table 1): 

1) With: one- exception (see- below), the ef.fi-- 
cient deubiquitination of the nascent ubiquitin- gal 
occurs irrespective of the nature of the amino acid 
residue of £gal at the ubiquitin-^gal junction. 
Thus, the apparently ubiquitin- specif ic protease 
that cleaves the original ubiquitin-/3gal protein at 
the Gly-Met junction is generally insensitive to the 
nature of the first residue of /3gal at the junction 
(Figure 3A and Table 1). This result, in effect, 
makes it possible to expose different amino acid 
residues at the amino - termini of the otherwise 
identical /Jgal proteins produced in vivo. 

2) The in vivo half-lives of the 0gal proteins 
thus designed vary from more than 20 hours to less 
than 3 minutes, depending on the nature of the amino 
acid residue exposed at the amino - terminus of /3gal 
(Figures 2, 4, and Table 1). Specifically, de - 
ubiquitinated /3gal proteins with either Met, Ser, 

25 Ala, Thr, Val , Cys or Gly at the amino - terminus have 
relatively long in vivo half-lives of 20 hours or 
more (figure 2, lanes d to g, and Table 1), similar 
to the half-life of a control 0gal whose gene had 
not been fused to that of ubiquitin. In striking 



15 



20 
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contrast, the £gal proteins with either Arg, Lys, 
Phe, Leu, Asp or Trp at the amino - terminus have very 
short half-lives, between approximately 2 minutes 
for Arg-0gal and approximately 3 minutes for Lys - 

05 /3gal, Phe-/3gal, Leu-/3gal, Asp-0gal, Asn-^gal and 

Trp-£gal (Figure 2, lanes n to u, and Table 1). The 
trarlf-lif a: of /Tg-al p.ira tains; wrLth. amino - terminal 
residues of either Gin, His or Tyr is approximately 
10 minutes (Figure 2, lanes k to m, and Table 1), 

10 while an amino - terminal lie or Glu confers on £gal a 
half -life of approximately 30 minutes (Figure 2, 
lanes h to j , and Table 1). Both pulse-chase and 
continuous labeling techniques were used in these 
experiments, and yielded similar results. 

15 The set of individual amino acids can be 

ordered with respect to the half-lives that they 
confer on /3gal when exposed at its amino - te rminus . 
The resulting rule (Table 1) is referred to as the 
"N-end rule" . 
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Table_lj_ The _N-end_rule 



In vivo 

Radius of deubiquitination 
Residue X in gyration of nascent 



ub-}C-/3gaL of X(A) ub-X-^g-al 



Met 1.80 + 

Ser 1.08 + 

Ala 0.77 + 

Thr 1.24 + >20 hours 

Val 1-29 + 

Gly 0 + 

Cys + 

He 1.56 + 

Glu 1.77 + - 30 minutes 

Tyr 2.13 + 

Gin 1-75 + "1° minutes 

His 

Phe • 1-90 + 

Leu 1.54 + ~ 3 minutes 

Trp + 

Asp 1.^3 + 

Asn + 

Lys 2.08 + 
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Arg 2.38 + - 2 minutes 



Pro 1.25 -* - 7 minutes 

*The rate of in vivo deub iqui tination of ub-Pro-£gal 
is extremely low. The t-j/2 shown is that of the 
inaLtriarl ub.-Pr o -/3gal fusion protein (see ELg.. 4, 
lanx&s?. j tro p ) 
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Le£end_to_Tab le_l 

The N-end rule. In vivo half-lives of /3gal 
proteins in the yeast S . cerevisiae were determined 
either by the pulse-chase technique (for short-lived 

05 jSgal's; see below) or by measuring the enzymatic 
activity of £gal in crude extracts. For the 
measurements of £gal activity, cells growing in a 
galactose-containing medium were transferred to an 
otherwise identical medium lacking galactose, and 

10 containi.n-g 10 percent glucose. After further growth, 
for at least 5 hours at 30 C, the ratio of 0gal 
activities per cell before and after shift to 
glucose was determined for each of the 0gal pro- , 
teins. [GAL promo ter - driven expression of the fusion 

15 genes (Figs. 1 and 3) is repressed in glucose 
medium]. For shorter - lived 0gal proteins 
(t 9 1 hour), the pulse-chase technique was used 
as^ell (Figs. 2 and 4). Ele c t r ophor e t ic bands of 
£gal proteins labeled with [ S]methionine in pulse- 

20 chase experiments were cut out from scintillant- 

impregnated- dried gels similar to those of Figs. 2 
and 4, and S in the bands was determined. The in 
vivo decay of short-lived £gal proteins deviated 
from first-order kinetics in that the rate of 

25 degradation was lower when measured at later (1 
hour) time points of the chase, the lower rate 
reflecting either a time - dependent toxic effect of 
cycloheximide or intrinsic characteristics of the in 
vivo degradation process. [Arrest of translation is 

30 required for an efficient short-term chase in j. 
c £I±Xisiae because of, the amino acid pool equili- 
bration problems related to the presence of vacuoles 
in this organism]. The half-life values listed 
below were determined for the first 10 minutes of 

35 chase. Several lines of evidence (see description 
of Figs. 4 and 6) suggest that Pro is a stabilizing 
residue. The listed radii of gyration of amino 
acids are from. M. Levitt, J. Mo 1 . Biol. 104:59 < 
(1976) . 
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Deubiquitination of Ub -X-£gal_fu^on_£ rote ins 
in AT P-de pleted reticulocyte extract^ 

Each of twenty 35 S-labelled Ub-X-/3gal proteins 
prepared in E. coli as described above, was added to 
05 an extract prepared from ATP-depleted rabbit 

reticulocytes (Et linger etjl., Proc^Natl^Acad^ 
Sci... USA. 7±v 54 (1977)- Hershko et_aL,. gEg-C^NatL . 
Acad. Set. USA 77: 1783 (1980); Hershko et_aL, 
Biol, Chenu 258: 8206 (1982)), and the fates of the 
10 added proteins were followed by SDS -polyacrylamide 
gel electrophoresis (SDS-PAGE). As had been 
observed in vivo with the same ubiquitin fusions in 
yeast,, an apparently ubiquitin- specif ic protease in 
reticulocyte extract deubiquitinated the added 
15 Ub-X-/9gal fusion proteins to yield the corresponding 
X-/3gal test proteins. The deubiquitination of 19 
out of the 20 Ub-X-0gal proteins in the ATP-depleted 
extract was more than 90% complete in 5 minutes at 
27* (Table 2). The single exception, both in yeast 
20 and in reticulocytes, is Ub-Pro-^gal, which was 

deubiquitinated approximately 20 times more slowly 
than were the other Ub-X-0gal proteins. 

Amino acid sequencing (by Edman degradation) of 
deubiquitinated -j3gal proteins reisolated from either 
25 the reticulocyte extract or yeast cells showed that, 
in every case tested, the proteolytic cleavage 
occured precisely at the Ub-^gal junction (Table 2). 
Although sequencing revealed that the amino termini 
of some X-/3gal proteins underwent specific 
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modificiations (see Table 2), in no case did these 
modifications involve proteolytic cleavages beyond 
the amino -terminal residue X. 

All of the deubiquitinated X-/igal proteins were 

05 metabolically stable in the ATP-depleted 

reticulocyte extract as judged from SDS-PAGE 
analysis and from- the. negligible- production. o£ 
acid-soluble radioactivity in the extract. Thus, 
preincubation of Ub-X-/Jgal fusion proteins in the 

10 ATP-depleted reticulocyte extract makes it possible 
to generate twenty X-/3gal test proteins which differ 
exclusively at the amino - terminal residue X. 
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Ha lf-life of a ggal protein in ATP - supplemented 
re ti culocyte extract is a function of the ggalVs 
amino- terminal re si due. 

While all of the twenty X-/3gal proteins were 

05 metabo.lically stable in the A/TP - depleted 
ratica'locjte extract., mos.tr- of them became 
short-lived upon addition of ATP to the extract 
(Table 2). We refer to an amino - terminal residue as 
stabilizing if the corresponding X-0gal is 

10 relatively long-lived in the ATP - supplemented 

extract (less than 10% degradation in 2 hours at 
37°C) r and as destabilizing if the degradation of 
the corresponding X-0gal in the extract exceeds 15% 
under the same conditions (Table 2). 

15 The time courses of degradation for several 

X-0gal proteins showed reproducible initial lags. 
However, semilogarithmic plots of the time courses 
showed that, after the initial lags, the degradation 
of X-0gal in the ATP - supplemented reticulocyte 

20 extract obeyed first-order kinetics for at least the 
first* two hours, making it possible to compare the 
degradation of different X-/3gal proteins by 
comparing their -half - lives In the extract (Table 2). 
The range of 0gal half-lives in the 

25 reticulocyte extract encompasses more than two 

orders of magnitude, from approximately 50 minutes 
for Gln-0gal to approximately 100 hours for Val-j0gal 
(Table 2). The half-lives of X-j9gal proteins- 
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bearing stabilizing amino - terminal residues range 
from approximately 20 hours for lle-0gal to 
approximately 100 hours for Val-j8gal. Half-lives of 
the metabolically unstable X-£gal proteins in the 

05 reticulocye extract were comparable to the 
half-lives of other proteolytic substrates 
(iodi.na.ted serum: albumin, lysoziym.e, and cytochrome 
c) in the same extract. These latter test proteins 
have been used in earlier studies of 

10 ubiquitin-dependent protein degradation in 

reticulocye extract (Finley e t _a 1 ± , Trends Jiochem^ 
Sci^ 10: 343 (1985); Etlinger et_aL, Proc^_NatL 
Acad^Sci^JJSA 74: 54 (1977)). Recently, at least 
some of these proteins have been shown to be 

15 targeted for degradation via their destabilizing 
amino -terminal residues [Reiss e t_a L , J^Biol^ 
ChenK 263: 2693 (1988)] as defined by the N-end 
rule . 
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rhe N-end rule in yeast and in mammalian reticulocytes. 



Residue X in 
Ub-X-Ggal 


Half-life o 


f X-Ogal 


Amino terminus of reisolated X-Bgal 
as determined by protein sequencing 


Yeast 
( S. cerevtsiae ) 
In vivo 


Mammalian 
reticulocytes 
In vitro 


Yeast 
In vivo 


Reticulocytes 
/n vitro 


Val 


>20 hours 


100 hours 




Val-Bgal ' 


Met 


>20 hours 


ou nuurs 


g 

ivit5L*tjya.l 


d e 

Met-Ggai 


G!v 


■sOn hnnre 

■*c\i iiuuis 

'tu llUUlo 


30 hours 




GIy-Ggai d,e 


Pro 


>*iu nours 


i 


i 


Ala 


>20 hours 


4.4 hours 


Ala-Bgal b 


Ala-flgal dlf 


Ser 

WCI 


nours 


1.9 hours 


h 


Ser-Bgal 


Thr 


>20 hours 


7.2 hours 


Thr-Ggal b 


Thr-flgal d,f 


Cys 


^ OH hm ire 


1.2 hours 




[?] - Ggal 


lie 


30 minutes 


20 hours 


b.c 

ue-ugai 


Ile-Ggal d#9 


GIu 


30 minufpQ 


1-U nours 


Arg-uiu-BgaJ 


J Glu-Bgal + Arg-Glu-Bgal 
L Arg-Glu-Bgal f 


His 


iu rninuies 


3.5 hours 




His-flgal d 


Tvr 

lyr 


10 minutes 


2.8 hours 


Tyr-fjgal b ' c 


Tyr-flgal d 


Gin 


10 mimttoe 
IU lilliiuicS 


0.8 hours 


Pl-Qta-Bgal 1 


f [?]-G!u-flgal + GIu-Bgal d,k 
[ Arg-GIu-Ggal f 


Asp 


3 minutes 


1.1 hours 


Arg-Asp-Bgal c 


f Asp-Bgal + Arg-Asp-flgal d 
[ Arg-Asp-Bgai f 


Asn 


3 minutes * 


1.4 hours 


Arg-Asp-Ggal c 


f Asn-Bgal + Asp-Bgal d 
[ Asn-Bgal + Arg-Asp-Bgal' 


Phe 


3 minutes 


1.1 hours 




Phe-Bgal d 


Leu 


3 minutes 


5.5 hours 




Leu-figal d 


Trp 


3 minutes 


\2^ hours 




Trp-Bgal d 


Lys 


3 minutes 


1.3 hours 




Lys-flgal d 


Arg 


2 minutes 


1.0 hours 




Arg-flgal d 
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Amino^terminal^location^ 
essentiaj,_f or_it^^ 
i.n_;y_e as t 

Site-directed mutagenesis was employed to 

05 insert a codon specifying a "stabilizing" amino acid 
(in this experiment, the Met residue) before the 
first codon. of 0gal at the ub iquLtin -jSga 1 junction 
(Table 3). Insertion of a stabilizing residue (Met) 
before either another stabilizing residue (Thr) or a 

10 variety of destabilizing residues (Gin, Lys , and 
Arg) at the ub iqui t in- jSgal junction invariably 
results in a long-lived deub iqui t inated figal (Table 
3). Furthermore, in contrast to ubiqui tin- Pro -0gal 
which is not only short-lived but also resistant to 

15 deubiquitination (Figure 4, lanes j to p, and Table 
1), ubiquitin-Met-Pro-/3gal is efficiently deubi- 
quitinated in vivo to yield a long-lived Met-Pro- 
0gal (Table 3). These results show that both the 
identity of amino acid residue and its amino- 

20 terminal location (presumably the presence of a free 
a-amino group) are essential for its effect on 0gal 
half-life. In addition, these results (Table 3) 
further support the expectation that ubiquitin- 
specific cleavage of the fusion protein occurs 

25 immediately after the last Gly residue of ubiquitin 
( Figure 3A) . 
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Tabled N-tenainal location of an amino acid is 
essential for its effect on /3gal half-life 



Fusion protein 


!j uEuu i qui l ina tea 
fusion protein 


T 

ud - inr - 0gal 
ub - Met - Thr - flgal 


>20 hours 
>20 hours 


T 

ub - GTn - flnal 
ub I Met - Gin - 0gal 


= 10 minutes 
>20 hours 


T 

ub - Lys -tfgal • 
ub - Met - Lys - 5gal 


-3 minutes 
>20 hours 


T 

ub - Arg - tfgal 
ub - Met - Arg - flgal 


= 2 minutes 
>20 hours 


j ub - Pro - 0ga1 
ub - Met - Pro - 0gal 


*• 

= 7 minutes 
>20 hours b 



Amino - terminal location of an amino acid is essen- 
tial for its effect on /3gal half-life. The inser- 
tion mutants were obtained essentially as described 
for the initial set of mutants except that a 32- 
residue oligonucleotide, 5 ' - CCCGGGATCCGTGC ( G/C/ 
T/) (G/T) CATACCACCTCTTAG was used, containing 14 
bases on the 5' side and 15 bases on the 3' side of 
the ambiguous codon inserted behind the Met codon. 
Bases in parentheses denote ambiguities at the 
positions 16 and 17 in the sequence. Half-lives of 
the corresponding ./Jgal proteins were determined as 
described in the legend to Table 1. 
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The electrophoretic patterns of short-lived 
(but not of long-lived) 0gal proteins invariably 

05 contain a specific, about 90 kD cleavage product of 
£gal (Figure 2, lanes n to u) which, unlike the 
parental jflgal species, accumulates, during the 
postlabeling (chase) period (Figure 4, lanes m-p) . 
The 90 KD £gal fragment constitutes a relatively 

10 small proportion of the initial amount of the 
pulse-labeled £gal. Nonetheless, its existence 
implies that an in vivo endopro teoly t ic cleavage can 
rescue a protein fragment from the metabolic fate of 
its short-lived parental protein. It remains to be 

15 seen whether the resulting possibility of multiple 
half-lives within a single protein species is 
exploited in the design of naturally short-lived 
pro te ins . 

Ubi£uitin^££al_is_sho~— 

20 tinated^ 

Ubiquitin-Pro -0gal , the only ub iqui t in- £gal 
fusion that is not deubiqui tinated in vivo (Fig. 4, 
lanes j to p), has a half-life of approximately 7 
minutes (Table 1) which is less than 1 percent of 

25 the half-life of me tabo lical ly stable 0gal proteins 
(Table 1). One interpretation of this result is 
that a me tabolically stable ubiquitin attachment to 
protein amino - termini is sufficient to signal . 
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degradation of acceptor proteins. This inter- 
pretation is consistent with earlier biochemical and 
genetic evidence that ubiquitination of short-lived 
proteins in a mammalian cell is essential for their 

05 degradation. At the same time, all ubiquitin- /3gal 
fusion proteins other than ubiquitin- Pro -/?gal are 
rapidly deub iquLtinated. in vivo (Table 1). Thus., 
the posttranslational amino-termal ubiquitination of 
proteins may not be involved in an initial recogni- 

10 tion or commitment step that designates proteins for 
degradation in vivo. Whether posttranslational 
amino- terminal ubiquitination (if it actually occurs 
in vivo) . is essential for later stages of the 
degradation pathway remains to be determined. 

15 Earlier in vitro experiments indicated that prefer- 
ential chemical modification of amino - termini of 
proteolytic substrates inhibits their degradation in 
an in vitro ub iqui t in- dep endent proteolytic system. 
Based on these data, it was proposed that amino-ter- 

20 minal ubiquitination of proteins is essential for 

their degradation. An alternative interpretation of 
the same results is that chemical blocking of 
proteins' amino - termini prevents the recognition of 
their amino- terminal residues by the "N-end rule" 

25 pathway whose initial stages are not necessarily 
ubiquitin- dependent . 
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Short^lived^ga^ 
in_v i vo . 

Overexposures of the pulse-chase fluorograms 
(Fig. 2) reveal that the major band of a deubi- 

05 quitinated, short-lived figal protein coexists with a 
"ladder" of larger molecular mass, £gal - containing 
bands irregulaxly spaced at 4 to 7 kD intervals. 
(Fig. 4, lanes c to g) . No such larger species 
appear when the fluorograms of long-lived £gal 

10 proteins are similarly overexposed (Fig. 4, lanes a 
and b).. Immunological analysis with both antibodies 
to £gal and antibodies to ubiquitin demonstrates 
that the "ladder" figal species contain ubiquitin 
(Fig. 5). 

1 5 A_m o de 1_ f o r _ t h e _ s e 1 e c t i vejegradationjathway . 

With the exception of natural or engineered 
ubiquitin fusion proteins (Fig. 1 and Table 1), 
nascent proteins apparently lack ubiquitin moieties. 
The in vivo amino - te rminal processing of nascent 

20 noncompartmentalized proteins generates their mature 
amino- termini via the action of amino - te rm inal 
peptidases whose substrate specificities have been 
partially characterized. (See Tsunasawa, S. et a 1 . 
j . Biol. Chem. 260 5382 (1985); fioissel, J. P. et a 1 . 

25 PNAS USA 82, 8448 (1985)). We suggest that the > 
amino- termini thus generated are recognized by an 
"N-end-reading" enzyme. One specific model is that 
a commitment to degrade a protein molecule is- made 
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as a result of the recognition of its amino - terminal 
residue by a stochastically operating enzyme whose 
probability of "clamping" at the target's amino- 
terminus is determined by the N-end rule (Table 1). 

05 Once the commitment is made, it is followed by a 
highly processive ub iquitination of the target 
protein, which, in the- ease of 0gal is- conjugated to 
more than 15 ubiquitin moieties per molecule of 0gal 
(Fig. 4 r lanes c to g, and Fig. 5.). The multiply 

10 ubiquitinated target protein is then degraded by a 
"down stream" enzyme (1) for which the ubiquitin 
moieties of the target serve as either recognition 
signals or denaturation (unfolding) devices, or 
both. 

15 The ubiquitin-containing "ladder" 0gal species 

(Fig. 4, lanes c to . 1 , and Fig. 5) consist of 
apparently branched ubiquitin moeities joined to the 
a-amino groups of internal lysine residues in /3gal. 
Surprisingly, the "ladder" 0gal species derived from 

20 ubiquitin-Pro -jSgal are electrophoretically indistin- 
guishable from the analogous species of £gal whose 
amino -terminal ubiquitin is cleaved off the nascent 
fusion protein (Fig. 4, lanes j to 1, and Fig. 5). 
If the electrophoretically indistinguishable ubiqui- 

25 tinated £gal species are indeed structurally homo- 
logous, these results would be compatible with two 
alternative models in which, immediately after the 
first ubiquitins are branch- conj ugated to jSgal, 
either a branch-ubiquitinated ubiquitin-Pro-/?gal 
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undergoes amino - terminal deubiqui tinat ion or, 
alternatively, an analogous £gal species lacking the 
amino- terminal ubiquitin moiety reacquires it. 
Experimental resolution of this ambiguity may 

05 establish whether the pos ttr ans lat ional amino- 

terminal ub iqui tinat ion of proteins (if it occurs in 
vivo) plays a role in selective protein turnover. 

Although both prokaryotic and eukaryotic 
proteins appear to follow the N-end rule (see 

10 below) , bacteria apparently lack the ubiquitin 

system. Thus it is possible that th.e hypothetical 
N-end-recognizing protein is more strongly conserved 
between prokaryotes and eukaryotes than is the rest 
of the selective degradation pathway. Interest- 

15 ingly, the properties of a mammalian protein E3 
whose presence is required for ubiqui tination of 
proteolytic substrates by ub iqui tin- conj ugating 
enzymes in vitro are consistent with it being a 
component of the N-end-recognizing protein. 

2 OTh e N^end_rule_and_the_known 

ill t£a c£iiHl*I_£I££— — ■ 

The unblocked amino- terminal residues in 

metabolically stable, noncompar tmentalized proteins 

from both prokaryotes and eukaryotes are exclusively 

25 (Fig. 6A) of the stabilizing class (Met, Ser, Ala, 

Gly, Thr, Val), that is, the class that confers long 

in vivo half-lives on /3gal (Table 1). The one 

short-lived intracellular protein for which the. 
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mature amino - terminus is known is the ell pro tein of 
phage lambda, the central component of a trigger 
that determines whether lambda grows lytically or 
lysogenizes an infected cell. (Y.S. Ho, D. Wulff, 

05 M . Rosenberg, in Regulation of Gene Exp ressio n , I . 
Booth and C. Higgins , Eds.. (Cambridge Univ. Press, 
London, 1986), p. 79; F. Banuett, It. A.. Hoyt, 
McFarlane, H . Echols , I. Herskowitz , J. Kol. Biol... 
187 , 213 (1986); M.A. Hoyt, D.M. Knight, A. Das, 

10 H.I. Miller, H. Echols, Cell 31, 565 (1982); K. 
Nasmyth, Nature (london) 3 20 , 670 (1983)). The 
half-life of ell in lambda- infected E. coli is less 
than 3 minutes. Strikingly, the mature amino - 
terminus of. ell starts with Arg (Ho, Y.W\ et al. , J. 

15 Biol . Chem. 257, 9128 (1982)), the most destablizing 
residue in the N-end rule (Table 1). 

While the destabilizing amino acids can be 
either hydrophobic, uncharged hydrophilic or charg- 
ed, they share the property of having larger radii 

20 of gyration than any of the stabilizing amino acids 
except Met (Table 1) . 

A mi no - termlnal_r es idues .i^^conxp.artmenta lized pro - 
teins are largely of the^des tab 11 izing class . 

Figure 6 illustrates a striking difference 
25 between the choice of amino - terminal residues in 
long-lived, noncompartmentalized intracellular 
proteins (A) and in compartmentalized proteins, such 
as secreted proteins (B) t many f which are als 
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05 



10 



long-lived in their respective extracellular com- 
partments. One implication of this finding is that 
a single intracellular degradation pathway operating 
according to the N-end rule could be responsible 
both for the diversity of in vivo half-lives of 
intracellular proteins and for the selective de- 
struction of compartmentalized proteins that are 
aberrantly introduced into the intracellular space. 
Some miscompartmentalized proteins may be more 
harmful to the cell than others. It is therefore of 
interest that secreted eukaryotic toxins contain 
strongly destabilizing residues (Arg, Lys . Leu, Phe . 
Asp) at their amino - termini more often than the 
general population of secreted proteins (Fig. 6, 

15 panels B to D) . 

The above consideration also suggest that, if 
the topological outside of a cell, such as lumens of 
the endoplasmic reticulum and golgi. and the extra- 
cellular space, were to have degradation pathways 
analogous to the N-end rule pathway, they could be 
based on "inverted" versions of the N-end rule in 
which the amino-terminal residues that are de- 
stabilizing inside the cell are now the stabilizing 
ones and vice versa. Thus, the methods of the 
present invention should also be useful for manipu- 
lating the metabolic stability and other properties 
of compartmentalized proteins, including secreted 



20 



25 



ones . 
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Possible role of the N-end rule path way in t he 
turnover of lang - 1 ived ^r o te ins . 

Long-lived intracellular proteins with destabilizing 
(Table 1) - penultimate residues generally retain 

05 their initial amino -terminal me thionine .res idue . 

The amino - terminal residues in long-lived intracel- 
lular proteins 5 that do undergo amino - terminal 
processing are . invariably of the stabilizing class 
(Table 1) . An interesting possibility that would 

10 involve the N-end rule pathway in the turnover of 
long-lived proteins is that the rate-limiting step 
in the in vivo degradation of long-lived proteins 
may be a slow aminopep tidas e cleavage that exposes a 
destabilizing residue, followed by rag id degradation 

15 via the N-end rule pathway. Note that fine-tuning 
of the rate of degradation may in this case be a 
function of the rate of aminopep tidas e cleavage 
exposing a destabilizing residue rather than a 
function of the residue's destabilizing capacity 

20 according to the N-end rule. 

Ifrg .N^end rule and selective degradation of short- 
lived and damaged proteins . 

The recognition of polypeptide chain folding 
patterns or of local chemical features that target 
25 an otherwise long-lived but damaged protein for 
selective degradation in vivo is unlikely to be 
mediated directly by the N-end rule pathway. 
Instead, we suggest that specific proteases 
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(analogous in function to nucleases that recognize 
specific lesions in DNA) cleave a targeted protein 
so as to expose a destabilizing residue at the 
amino- terminus of one of the two products of a cut. 
05 One testable prediction of this model is that the 

initial cleavage products of the degradation pathway 
should bear destabilizing residues at their N-ter- 
mini. The preferential exposure of destabilizing 
residues at the amino - termini of products of the 
10 initial protein cleavages may be due either to 

intrinsic specificities of the proteases involved or 
simply to the fact that a majority of the amino 
acids belong to the destabilizing class (Table 1). 
Furthermore, initial cleavages of a protein would be 
15 expected to destabilize aspects of its original 
conformation, thus increasing the probability of 
further internal cuts. Whether the initial cleavage 
products of a protein would be degraded exclusively 
via the N-end rule pathway or would have to be 
20 processed further by additional internal cleavages 
should depend on several factors, such as the 
exposure of destabilizing residues at the amino- 
termini of initial cleavage products, and the 
relative rates of introduction of internal cuts. In 
25 this model, the N-end rule pathway should be es- 
sential for degradation of most of the me tabolically 
unstable proteins, from chemically damaged, pre- 
maturely terminated, improperly folded and miscom- 
partmetitalized ones to those that cannot assemble 
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into native multisubunit aggregates, and finally to 
otherwise normal proteins that are short-lived in 
vivo. Thus, the metabolic instability of a protein 
may be mediated not only by the exposure of a 

05 des tabilizing residue at its amino - terminus , but 
also by local conformational and chemical features 
of its: polypep tide chain that result in proteolytic 
cleavages exposing destabilizing residues at the 
amino- termini of cleavage products. 

10 For any given protein, a variety of factors in 

addition to the N-end rule may combine to modulate 
its half- life in vivo. Among such factors may be 
the flexibility and accessibility of the protein's 
amino- terminus (Thornton, J.tf. and Sibanda, B.L., J. 

15 Mol . Bio. 16 7 443 (1983)), the presence of chemical- 
ly blocking amino - terminal groups such as the acetyl 
group, the distribution of ub iquitinatab le lysine 
residues near the amino - terminus r and other variabl- 
es, such as the structure of the carboxy - terminus . 

20 Since amino - terminal regions of multisubunit pro- 
teins are commonly involved in the interfaces 
between subunits (Thornton, J.M. and Sibanda, B.L., 
J . Mol. Bi£. 16.7 443 (1983)), quarternary structure 
of proteins is yet another parameter that is ex- 

25 pected to modulate the impact of the N-end rule 

pathway on protein half -lives in vivo. Finally as 
suggested above, the N-end rule pathway may also be 
essential for the degradation of proteins whose 
initial recognition as targets for degradation is 
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independent of the structures at their amino- 
termini . 

Functional_si£nif icance 
£ion_of_amino - acids_to_ami 

05 It has been known for many years that in both 
bacteria and eukaryotes there exists an unusual 
class of enzymes, aminoacyl- transfer RNA-protein 
transferases, which catalyze pos ttrans la tional 
conjugation of specific amino acids to the mature 

10 amino-termini of acceptor proteins in vitro (R.L. 

Soffer, in Trans f e r_RN Aj_B i o 1 o fi i «l_As £e c t s f D. Soil, 
J.N. Abelson, P.R. Schimmel, Eds. (Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY 1980), 
p493;C. Deutch, Me tho ds_Enz^mo 1 . 10 6, 198(1984): A. 

15 Kaji, H. Kaji, G.D. Novell! , J iol^Ch e m . 240, 
1185 (1965)). The pos ttranslational addition of 
amino acids to proteins in vivo dramatically ac- 
celerates in a stressed or regenerating tissue, for 
example, after physical injury to axons of nerve 

20 cells (S. Shyne-Athwal, R.V. Riccio, G. Chakraborty, 
N.A. Ingolia, Science 231, 603 (1986); N.A. Ingolia 
fit al., J^Neurosci 3, 2463 (1983)). The N-end rule 
provides an explanation for this phenomenon. We 
suggest that selective changes in metabolic sta- 

25 bility of otherwise undamaged, longlived proteins 
that may be required by a changed physiological 
state of the cell are brought about by posttransla- 
tional addition of destabilizing amino acids to the 
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amino -termini of target proteins in vivo. Striking- 
ly, the known reactions of pos ttrans lational addi- 
tion of amino acids to proteins (R.L. Soffer, in 
Transfer RNA:_Biolo g;ical Aspects , D . Soil, J.N. 

05 Abelson, P.R. Schimmel, Eds. (Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY 1980)., p493;C. 
Deutch, Methods Snzymol . 10 6 , 198 (1984): A. Ka j i , H.. 
Kaji, G.D, Novelli, J. Biol. Chem. 240, 1185 (1965); 
S. Shyne-Athwal , R.V. Riccio, G . Chakraborty, N . A. 

10 Ingolia, Science 231, 603 (1986); N.A. Ingoliaet 
al., J. Neurosci 3, 2463 (1983)) involve largely 
those amino acids (Arg, Lys , Leu, Phe , and Tyr) that 
are destabilizing according to the N-end rule (Table 
1). Physiological states in which addition of 

15 destabilizing amino acids to proteins could be 

expected to occur include entry to and exit from the 
cell cycle, responses to. chemical or physical 
stress, and specific differentiation events, such as 
erythroid differentiation and spermatogenesis, in 

20 which a proportion of preexisting, otherwise long- 
lived intracellular proteins is selectively de- 
graded. 

The in vitro degradation of some proteolytic 
substrates in a 'ubiquitin-dependent system from 
25 mammalian reticulocytes has recently been shown to 
depend on the presence of certain aminoacy 1 - tRNAs 
(Ferber, S. and Ciechanover, A.., J. Biol . Chem. 261 
3128 (1986)). We suggest that this phenomenon also 
reflects a requirement for pos ttranslational 
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addition of specific destabilizing amino acids to 
the amino- termini of proteolytic substrates. The 
initial proteolytic substrates in question have 
amino- terminal residues of Asp or Glu, both of which 

05 are destabilizing according to the N-end rule 

(Table 1). This raises an interesting and testable 
possibility that certain amino - terminal residues in 
proteins may not be directly destabilizing as such 
but only through their ability to be conjugated to 

10 other destabilizing residues. 

Ubi£uitin_fusions_wit^ 

In a set of Ub fusions constructed with mouse 
dihydrofolate reductase (DHFR) , a monomeric -20-kd 
protein whose structure is known at atomic resolu- 

15 tion, the mature amino - te rminus of the "natural" 

DHFR is extended by 7 residues due to a construction 
route taken (Figure 7). After cleavage of Ub from 
the nascent ubiquitin-DHFR fusion proteins in vivo, 
the deubiquitinated DHFR proteins differ exclusively 

20 at their amino - terminal residues. These construc- 
tions are analogous to the set of jSgal test proteins 
(Figure 3). As expected, the DHFR proteins bearing 
those amino- terminal residues that are stabilizing 
according to the N-end rule (Table 1) are long-lived 

25 in yeast (Figure 7 and data not shown) . Although 
the presence of a residue that is destabilizing 
according to the N-end rule at the amino - termini of 
an otherwise identical DHFR protei-n does destabilize 
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it in vivo, the extent of destabilization is small 
(Figure 7A) in comparison to the results with 0gal 
of analogous design (Table 1). To address the 
mechanistic significance of these findings, a 

05 40-residue amino - terminal region of 0gal was 

positioned upstream, of the original DHF.R's amino - 
terminus (Figure 7) . The DHFR proteins" bearing a 
destabilizing residue followed by the 0gal - derived 
extension are approximately as short-lived in vivo 

10 as their unstable jSgal counterparts , in striking 
contrast to the otherwise identical DHFR proteins 
chat lack the 0gal - specif ic amino - terminal extension 
(Figure 7B and data not shown; cf. Figure 7A) . 
Furthermore, the extension-bearing DHFR proteins 

15 that have s tabilizi ng residues at their amino - 

termini are long-lived in vivo (Figure 7B) . This 
latter result proves that the 0gal - specif ic ex- 
tension as such, in the absence of a destabilizing 
amino - terminal residue, does not confer a short 

20 half -life on DHFR. These findings also indicate 

that the reason for the striking difference between 
half -lives of the DHFR's that either lack or contain 
the ^gal-specific extension (and bear identical, 
destabilizing amino - terminal residues) is due to 

25 differences in amino -terminal targeting elements in 
these proteins and not to differences between the 
overall structures of DHFR and £gal. 
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When DHFR is fitted with a 26-residue, ^gal- 
derived amino- terminal extension instead of the 
original 40-residue extension, the dependence of the 
in vivo half-life of the resulting protein on the 

05 nature of its amino - terminal residue is intermediate 
between that of the original DHFR and that of the 
DHFR bearing a 36-residue 0gal- derived extension 
(Figure 7C; cf. Figure 7B) . Thus, the sequences 
required for the effect of the original 0gal- 

10 specific extension are not confined to a short 
stretch within the extension but are distributed 
over the length of the extension. These insights 
indicate that the complete amino - terminal degrada- 
tion signal contains a distinct determinant addi^ 

15 tional to the determinant represented by the amino- 
terminal amino acid residue. To address the nature 
of the second determinant in greater detail, a 
number of otherwise identical DHFR-based proteins 
bearing different variants of the ^gal-derived 

20 extension and either a stabilizing or a de- 
stabilizing amino-terminal residue were expressed in 
the yeast S. cerevisiae and their half-lives 
determined (Figure 8). The first conclusion from 
the data shown in Figure 8 is that the two lysine 

25 (K) residues present in the jSgal extension, although 
by themselves they do not render the protein 
metabolically unstable, are absolutely essential for 
conferring sensitivity to the N-end rule upon the 
rest protein. Indeed, while the conversion o"f just 
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one of the two lysine residues into a similarly 
charged arginine (R) residue still results in a 
protein whose half life is a strong function of its 
amino- terminal residue (structures II-IV in Figure 
05 8), the conversion of bot h lysine residues into 
arginine residues results in a long-lived test 
protein whose- half -life-- is essentially insensisitive 
to the nature of its amino - terminal residue 
(Structure V in Figure 8). At the same time, lysine 
10 residues are the only amino acid residues in pro- 
teins that can be pos ttranslationally joined to the 
carboxy- terminus of ubiquitin, with the formation of 
branched ubiquitin-protein conjugates. Strikingly, 
our direct determination of the positions of 
15 ubiquitin moieties in multiply ub iqui tinated , 

short-lived proteins of the type shown in Figure 8 
has shown that all of the multiple ubiquitin 
moieties attached to a given molecule of the test 
protein reside in branched Ub-Ub structures attached 
20 to one of the two lysine residue that have been 
identified above by genetic methods as being es- 
sential components of the complete amino - terminal 
degradation signal. What then distinguishes the 
above lysine residues (Structures II-IV) from the 
25 numerous other lysine residues in the rest of the 

DHFR test protein? A clue to the unique role of the 
lysine residues as the second determinant of the 
amino- terminal degradation signal is provided by the 
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fact that, due to the design of the original ex- 
pression vector used in our work (see Figure 1), our 
0gal test proteins bear a 45-residue amino - terminal 
extension derived from an internal sequence of the 

05 lac repressor encoded by the lad gene. Thus the 
"0gal-derived n amino - terminal extension discussed 
above (Figures 7 and 8) is derived not from the 
amino- terminal sequence of the wild-type 0gal but 
from an unrelated sequence present at the amino- 

10 termini of our £gal test proteins. It is likely 
that the lac repres sor - spec if ic extension at the 
amino- termini of these 0gals is more disordered 
(segmentally mobile) than the amino - te rmina 1 region 
of the wild-type 0gal. If so, this extension, while 

15 not metabolically destabilizing £gal as such, could 
allow the observed extreme dependence of the 0gal's 
half-life on the nature of its amino - terminal 
residue (Table 1), and thereby, in hindsight, could 
have greatly facilitated the discovery of the N-end 

20 rule. The disordered (segmentally mobile) state of 
the 0gal extension provides an explanation for the 
unique nature of lysine residues within the ex- 
tension versus the lysine residues in the spatially 
ordered DHFR portion of the test protein (Figure 8). 

25 Thus, the simplest interpretation of this and 

related evidence is that the complete amino - terminal 
degradation signal comprises not one but two dis- 
tinct determinants, each of which is necessary but 
by itself not sufficient to render the protein 
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metabolically unstable. One determinant, described 
in the first part of this application, is the 
protein's amino - terminal residue. The second 
determinant, described immediately above, is a 

05 specific internal lysine residue. As indicated by 
the data of Figure 8 and the considerations above, 
the ability of this critical lysine residue- to serve 
as the second determinant is to a significant degree 
independent of the unique amino acid sequences sur- 

10 rounding the lysine residue . Instead, an essential 
feature of the critical lysine residue includes its 
spatial proximity to the protein's amino - terminus . 

Eguiya-lents 

Those skilled in the art will recognize, or be 
15 able to ascertain using no more than routine experi- 
mentation, many equivalents to the specific embodi- 
ments of the invention described herein. Such 
equivalents are intended to be encompassed by che 
following claims. 
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CLAIMS 



1. A method of regulating the metabolic stability 
of an intracellular protein, comprising 
producing a protein in vivo having a 

05 predetermined amino acid at the amino terminus, 

other than methionine, of the stabilizing or 
destabilizing class according to the N-end rule 
of protein degradation. 

2. A method of me tabolically destabilizing an 

10 intracellular protein comprising producing the 

orotein in vivo having a predetermined 
amino - terminal amino acid, other than 
methionine, of the destabilizing class. 

3. A method of Claim 2, wherein an amino acid of 
15 the destabilizing class is selected from the 

group consisting of Isoleucine, Glutamic acid, 
Tyrosine, Glutamine, Phenylalanine, Leucine, 
Aspartic Acid, Lysine or Arginine . 

4. A method of Claim 2, wherein an amino acid of 
20 the destabilizing class in a mammalian cell is 

selected from the group consisting of 
Arginine, Lysine, Histidine, Phenylalanine, 
Leucine, Tryptophan, Alanine, Serine, 
Threionine, Aspartic Acid, Glutamine, Glutamic 
25 Acid, Cysteine or Asparagine. 
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5. A method of Claim 3, wherein the amino - terminal 
region of the protein comprises a structure 
having one or more lysine residues in suf- 
ficient spatial proximity to the amino - terminus 

05 of the protein to provide for the second 

determinant of the- complete amino - terminal 
degradation signal according to the N-end rule 
of protein degradation. 

6. A method of producing a protein which is 
10 metabolioally more or less stable than its 

wild-type counterpart, comprising introducing, 
in vivo at the amino - terminus of the wild-type 
protein a predetermined amino acid, other than 
methionine, of the stabilizing or destabilizing 
15 class according to the N-end rule of protein 

degradation to produce a modified protein 
having a stabilizing or destabilizing amino 
acid at its amino - terminus . 

7. A method of Claim 6, wherein the protein is 
20 produced as a fusion protein containing a 

masking protein j oined to its amino - terminus , 
the masking protein being pro teoly tically 
cleavable at the junction with the 
amino - terminus of the modified protein. 



25 8 . 



A method of Claim 7, wherein the masking 
protein is ubiquitin. 
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9. A method of Claim 8, wherein ubiquitin- specif ic 
protease is used to cleave ubiquitin from the 
fusion protein at the ub iqui t in - pr o t e in junc- 
tion either iri vivo or in vitro . 

05 10. A method of Claim 7, wherein the masking 

protein- is joined to the amino - terminus of trhe 
protein through an amino acid sequence 
comprising a site for recognition by an endo- 
pro tease . 

10 11. A method of Claim 10, wherein the protein is 
produced in vivo and subsequently cleaved in 
vitro by endoprotease the factor X^. 

12. A method of producing a metabolically unstable 
protein comprising producing the protein as a 

15 fusion protein wherein ubiquitin is joined to 

the amino - terminus of the protein in a manner 
such that the fusion protein cannot be ef- 
ficiently deub iqui t ina ted . 

13. A method of Claim 12, wherein the fusion 

20 protein comprises ub iqui t in - pr o 1 ine - p r o te in . 

14. A method of Claim 12, wherein the fusion 
protein contains ubiquitin having a modifica- 
tion in the amino acid sequence recognized by 
ubiquitin- specif ic processing protease so as to 
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reduce the efficiency of deub iqui tination of 
the fusion protein by the protease. 

15. A gene construct encoding a fusion protein, the 
gene construct comprising a DNA sequence 

05 encoding. a masking protein joined to a DNA 

sequence, encoding a prote.in of interest having 
a predetermined amino acid of the metabolically 
stabilizing or destabilizing class at its 
amino - terminus , the masking protein being 

10 pro teolytically cleavable at the junction with 

the amino - terminus of the protein of interest 
such that cleavage results in the exposure of 
the amino - terminus of the protein of interest. 

16. A gene construct of Claim 15, wherein the DNA 
15 sequence encoding the protein of interest 

contains a DNA sequence in the 5' region 
encoding a structure having one or more lysine 
residues, the DNA sequence being located such 
that the lysine residues are in sufficient 
20 spatial proximity to the amino - terminus to 

provide for the second determinant of the 
complete amino - terminal degradation signal. 



17. A method of Claim 15, wherein the masking 
protein is ubiquitin. 
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18. A method of producing a protein having a 
predetermined amino - terminal amino acid, com- 
prising expressing the protein as a fusion 
protein wherein the amino - terminus of the 

05 protein is fused to a masking protein which is 

specifically proteoly tically cleavable in vivo 
or in vitro a.t the junction with the 
amino-terminal amino acid of the protein. 

19. A method of Claim 18, wherein the masking 
10 protein is cleavable by a endoprotease . 

20. A method of Claim 19, wherein the masking 
protein is ubiquitin and the cleaving endo- 
protease is ubiquitin-specific protease. 



21 



A method of producing a protein having a 
15 predetermined amino-terminal amino acid com- 

prising : 

a. preparing a DNA construct encoding a 
fusion protein comprising: 

i) a structural gene encoding the 
protein, the gene having a sequence 
at its 5' end encoding the pre- 
determined amino-terminal amino acid 
of the protein; and 

ii) DNA encoding a masking protein linked 
to the 5' end of the structural gene 
encoding the protein, the masking 



20 



25 
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protein being specifically 
proteolytically cleavable in vivo or 
in vitro at its junction with the 
amino -terminus of the protein; and 
05 b. expressing the DSTA construct in a host 

cell to produce the fusion protein encoded 
by the DNA construct,, the fusion protein 
being specifically cleaved, proteo- 
lytically, to provide a protein having the 
10 predetermined amino - terminal amino acid. 

22. A method of Claim 21, wherein the fusion 

protein is cleaved within the host cell to 
release the protein having the predetermined 
amino- terminal structure . 

A method of Claim 21, wherein the fusion 
protein is produced in a cell which lacks an 
enzyme which pro teoly tically cleaves the 
masking protein and the fusion protein is 
subsequently pro teo ly tically cleaved in vitro 
to release the protein having the predetermined 
amino -terminal structure . 

24. A method of Claim 21, wherein the masking 
protein is ubiquitin. 



15 23. 
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25. A DNA construct for producing a protein having 
a predetermined amino - terminal amino acid 
comprising: 

a. a structural gene encoding the protein, 
05 the gene having a sequence at its 5' end 

encoding_the predetermined amino - terminal 
structure of the protein; and 

b. DNA encoding a masking protein linked to 
the 5' end of the structural gene encoding 

10 the protein, the masking protein being 

specifically pro teoly tically cleavable in 
vivo or in vitro at its junction with the 
amino - terminus of the protein. 

26. A method of Claim 25, wherein the masking 
15 protein is ubiquitin. 

27. Isolated ubiquitin- specif ic protease capable of 
cleaving ubiquitin from the amino - te rminus of a 
protein to which it is joined at the junction 
between ubiquitin and the amino - te rminus of the 

20 protein. 

28. An isolated DNA sequence encoding ubiquitin- 
specific protease of Claim 27. 



29. A mutant cell line which is conditionally or 
unconditionally incapable of synthesizing one 
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or more proteases involved in the N-end rule 
degradative pathway . 

30. A DNA, construct for insertion into the 5' 
region of a structural gene, comprising a 

05 sequence which- encodes a polypeptide structure 

containing, one. or more lysine- residues, the. 
codons for the lysine residues being positioned 
within the construct such that when the 
construct is inserted into the structural gene, 

10 the lysine residues are sufficiently spatially 

proximate to the amino - terminus of the encoded 
protein to serve as the second determinant of 
the complete amino - terminal degradation signal. 
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